Reference no: EM132552223
ISIT219 Knowledge and Information Engineering Assignment - University of Wollongong, Australia
Business Case - YouTube is one of the largest video-sharing websites worldwide, with an estimated monthly viewership of 1 billion and serves as an important source for analyzing online user activity. In this assignment, we are taking YouTube as the main resource. There is a great potential of using YouTube data in a wide range of real -life applications. As a group of knowledge engineers, your team is required to use knowledge creation and representation techniques to analysis available YouTube data, for gaining an in -depth knowledge of user online activity. You will need to decide one topic that is of your interest, and clearly state that in your report.
Your tasks -
1. Some related topics include, but not limited to:
the influence analysis from video channels (tips: identify popular video channels and explore their influence in relation to type of video, likes/dislikes and received comments , etc. , over the time span)
sentiment analysis of comments (tips: find out the relationship between "likes" ("dislikes") and "description")
NLG (nature language generator) (tips: find out the relationship between "tags" and "description")
categorising videos based on comments (tips: find out the relationship between "category_id" and "description")
prediction of video popularity (tips: find out the relationship between "views" and "description, comment_count, category_id", etc)
You need to choose a YouTube-related topic, and state it explicitly in your report.
2. Apart from the available datasets, it is expected that you collect other necessary information and/ or existing case studies from academic resources (such as journal papers and books) to facilitate your research. This will be presented as the knowledge acquisition part in your project.
3. Various knowledge creation techniques can be employed including, but not limited to:
Classification (such as DT or ANN)
Clustering (such as SOM)
Association analysis (such as rule mining)
4. Finally, you need to write a report (maximum 2500 words) to elaborate on the following item:
Knowledge Acquisition or elicitation process
The techniques that you have employed for knowledge creation
- You need to justify the choice of techniques
- You need to provide at least 2 techniques to achieve full mark of knowledge creation section
Results and Discussions
- The information resource that you have gathered to assess the generated knowledge
- You can compare and contrast each knowledge category that is generated in the previous section with the existing documents or case studies from existing academic papers
- Minimum 2 pieces for each knowledge category are expected to achieve full mark
Explain and justify the possible inconsistencies in the gathered knowledge.
Note - Only write the classification of DT. Using rapidminer studio to data mining.
Attachment:- Knowledge and Information Engineering Assignment Files.rar