Reference no: EM132334614
Assignment -
Instructions: Need help with data analysis in Stata (with a do file and maps) for a master's thesis, including a detailed report with the discussion why some variables were used in the regression, regression approach, necessary tests and checks, maps, interpretation of all results etc.?
The topic of the master's thesis is "Innovative activity in East-Central and Eastern Europe during the early 20th century".
Some suggested regressions -
1) Regress GDP per capita on the number of patents per capita by country and 5 or 10-year period (panel data, probably fixed effects model). To avoid the problem of taking a log when the number of patents is zero, take a log of the number of patents plus e.g. 1 or 0, 1 was suggested. The GDP growth rate for these countries can probably be obtained. In case that GDP is not available for all countries and all periods, some proxy indicators (e.g. height) can be used.
The formula for height GDP per capita is: Ln(GDPC) = -10,094+0,105 * Height (from one paper).
Use three different dependent variables in that regression: 1) interpolated per capita GDP; 2) the height GDP per capita; 3) combination of interpolated per capita GDP for countries where at least one actual per capita GDP observation was recorded between 1900 and 1933, and in other cases height GDP per capita.
2. Another regression approach is to regress the total number of patents from each place on the university, railway, distance to Berlin, and patent structure (= country share of 12th class patents). When I showed this regression to the supervisor, it was okay for him, but I am not sure if such have done this regression correct, since I had ommited variable problem there.
Also obtained a general suggestion from the supervisor that per capita thing is important. Please do these regression as well.
Attachment:- Instructions & Data.rar