Reference no: EM133270135
Exercise 1: Counting Poissons
The data/fish.csv is a data set of camping trips taken by 250 groups of people.
The campers may or may not have done some fishing during their trip.
If a group did some fishing, they would have caught zero or mor fish.
We want to estimate not only how many fish were caught (if there was fishing done by a camping group), but also the probability that the camping group caught any fish at all.
Here's info on the columns:
FISH_COUNT: The number of fish that were caught. This will be our dependent variable y.
LIVE_BAIT: A binary variable indicating whether live bait was used.
CAMPER: Whether the fishing group used a camper van.
PERSONS: Total number of people in the fishing group. Note that in some groups, none of them may have fished.
CHILDREN: The number of children in the camping group.
Your task is to predict the number of fish caught (FISH_COUNT) by a camping group based on the values of LIVE_BAIT, CAMPER, PERSONS and CHILDREN variables.
Use what we learned on count variables and zero-inflated datasets to achieve the best model you can.
Interpret the models you used to give an analysis of each feature's effect on the predicted fish caught.
N.B. Please appreciate the effort we went through to find a fish dataset for a count problem pun.