Project about the time-series data trend prediction

Assignment Help JAVA Programming
Reference no: EM13968814

The specific problem in this project is about the time-series data trend prediction. The specific application scenario is in e-commerce. You are given a real dataset obtained from a real-world e-commerce application where there were 1000 products and 31490 customers (i.e., buyers) who bought these products. Of these 1000 products there are 100 key products (popular products). Also these 1000 products are in 15 categories. The specific data are given in the seven tables and the specific details of these tables are given below. The time window of this dataset is in 119 days with data documentation for each day. Hence, the time unit is one day where the timeline goes from the 0-th day to the 118-th day (17 weeks in total). Now you are asked to do the sale quantity prediction for the 100 key products for each day between the 119-th day and the 146-th day (four weeks).

• buyer_basic_info.txt: the basic attribute information of the buyers; in particular, the column names of this table are "buyer_id", "registration_time", "seller_level", "buyer_level", "age", and "gender". If we do not know the gender of a buyer, we set this buyer's gender attribute as -1.

• buyer_historical_category15_quantity.txt: the consumption quantities in the 15 categories for the buyers; in particular, the column names of this table are "buyer_id", "consumption quantity in the 1st category", ..., and "consumption quantity in the 15th category". The 15 categories are the ones of the products the customers bought in this dataset.

• buyer_historical_category15_money.txt: the consumption amounts in the 15 categories for the buyers; in particular, the column names of this table are "buyer_id", "consumption amount in the 1st category", ..., and "consumption amount in the 15th category".

• product_features.txt: the basic attribute information of the products; in particular, the column names of this table are "product_id", "attribute_1", "attribute_2", and "original price".

• Key_product_IDs.txt: the key product IDs

• trade_info_training.txt: the trade information between the key products and the buyers from the 0-th day to the 118-th day (17 weeks); in particular, the column names of this table are "product_id", "buyer_id", "trade_time", "trade_quantity", and "trade_price".

• product_distribution_training_set.txt: there are 120 columns, where the 1-st column shows the "product_id" and the 2-nd to the 120-th columns show the "quantities" of the key products from the 0-th day to the 118-th day; for example, the element at the 5-th row and the 10-th column in this table shows the quantity of the 5-th product at the 8-th day.

For grad students you are asked to do the prediction for the overall sale quantity of the 100 key products for each day of the four weeks (i.e., for each of the time window from the 119-th day to the 146-th day), and also for each key product for each day of the four weeks.

This phase is for the coding part of the project and concerns with the implementation of a time-series prediction method that you either take from the literature or you have developed by yourself as the result of your research in the first phase.

Please make sure to follow the format requirement as the text output file specified here. The file puts each prediction as one line where the first prediction is for the overall prediction and each subsequent prediction is for a key product. Each prediction output line begins with the key product id where the overall prediction id is 0. There is a space between the prediction and the key product id. Then there is a space between a pair of the predictions of two neighboring days. The prediction lines in the output file begin with the first line as the overall prediction where the product id is 0, and then the first key product prediction with the smallest product id (i.e., 1), all the way to the last line as the prediction for the last key product prediction (i.e., id = 964). Also note that for undergrad students your output file only has one line prediction just for the overall prediction beginning with the product id = 0.

What you need to turn in: you shall turn in a zipped package containing the source code of your implementation of the prediction method with appropriate comments and documentations in the code, a README file to explain how to compile and run your code under what specific environment, and a text file containing the output matrix following exactly the format requirement stated above.

Attachment:- Data.rar

Reference no: EM13968814

Questions Cloud

Full interpretation of the results : The results are analyzed via two- factor ANOVA, one factor being network and the other factor being news time. Complete the following ANOVA table for this study, and give a full interpretation of the results.
Problem regarding the mixed design and blocking : The results were analyzed using a mixed design and blocking, and the reported results were as follows.14
Which types of tides are found in the united states : The purpose of this activity is to learn how to graph tidal data from locations in the United States and to interpret your results.
Problem regarding the random-effects model : 1. Discuss the context in which can be analyzed by using a random-effects model. 2. What are the reasons for conducting a two-way analysis rather than two sep- arate one-way ANOVAs? Explain.
Project about the time-series data trend prediction : The coding part of the project and concerns with the implementation of a time-series prediction method that you either take from the literature or you have developed by yourself
Difference between monopoly and strong competition : Further analysis reported that for the difference between monopoly and limited competition, F (1, 272) = 67.9 and for the difference between monopoly and strong competition, F (1, 272) = 71.3.10
Identify the biological concepts from the course : Include a cover page containing the title of the assignment, the student's name, the professor's name, the course title, and the date. The cover page and the reference page are not included in the required assignment page length.
Degrees of freedom for factor : A further analysis of differences between every pair of groups was reported as all p 0.01.9 Interpret these reported ?ndings. What were the degrees of freedom for Factor, Error, and Total?
Create a document which defines and describes it : Create a document which defines and describes IT. You may use any resource; however, be sure to cite any resources you use

Reviews

Write a Review

JAVA Programming Questions & Answers

  Trying to write it java first and then pseudo code

A popular diversion in the United States, "word find" (or "word search") puzzles ask the player to find each of a given set of words in a square table filled with single letters. Trying to write it java first and then pseudo code

  Prepare a polynomial-time algorithm

Write a polynomial-time algorithm that, given an initial configuration (P_1, P_2, ..., P_k), decides if it is a winning configuration - Prepare a polynomial-time algorithm that decides if it is a winning configuration.

  Method named isallvowels that returns

Write a method named isAllVowels that returns whether a String consists entirely of vowels (a, e, i, o, or u, case-insensitively). If every character of the String is a vowel, your method should return true. If any character of the String is a non-vo..

  Describe the project requirements and other assumptions

Summarize the project requirements and other assumptions (i.e., budget, human resources, workload, environmental dependencies, etc.). Define a work breakdown structure and describe the methodology behind constructing one

  Initialize an array

Which will legally declare, construct, and initialize an array?

  Explain the conversion from class to basic type

Conversion from Class to Basic Type

  Write a restful web service demo example using spring mvc

Write a restful web service demo example using spring mvc

  Mean and standard deviation using using eclipse

Java programming to calculate Mean and standard deviation using Using Eclipse.

  Write a public static method writetokenstolines

Write a public static method called writeTokensToLines that will read an input file one token at a time using Scanner, and write the tokens to a file using a PrintWriter, one token per line, with the tokens numbered starting at 1.

  Develop java package comprising of class using encapsulation

Develop the Java package comprising of Class and Student with the given requirements. Encapsulation, Method to print student details alone.

  Write a one-class java program

Write a one-class Java program with at least one method (besides main) to determine if the data in your dataset (i.e., in data.txt) follows Benford's law.

  You need to train for 10 weeks

You are preparing for a marathon. In order to prepare, you need to train for 10 weeks, running an increasing number of miles per week, starting at running at least 2 miles your first week up to 26 miles by week 10.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd