Build the model and tune parameters using training

Assignment Help Computer Engineering
Reference no: EM133417830

Programming Assignment

Named Entity Recognition (NER) with Recurrent Neural Network (RNN)

Build a recurrent neural network for Named Entity Recognition (NER) on CONLL 2003 dataset. Your task is to classify words into 10 different classes: , O, B-ORG, B-PER, B-LOC, B-MISC, IORG, I-PER, I-LOC, I-MISC. We are identifying whether words are part of a phrase referring to an organization, person, location, or miscellaneous. B indicates that word is at the beginning of the phrase, I indicates that the word is inside the phrase but not the first word, O indicates it is outside the phrase (does not belong to it).

Data: You can find training, test, and validation sets on Blackboard. You will build the model and tune parameters using training and validation data, and evaluate the final model (after all development and tuning) with the test data.

Pre-processing: Read the complete data. First column has the words to be classified, and last column shows the gold standard tag for each word. Lower case capitalized words (i.e., starts with a capital letter) but not all capital words (e.g., USA). Do not remove stopwords. Data is already separated by sentence and tokenized, so do not use different tools to tokenize for this task. Separate data by sentence. Once you know the maximum sentence length in the data, append 0s at the end of shorter sentences to make them match this max length. Set the tag for the 0s to .

Training: Build an RNN. Start with a vanilla RNN, with one layer of 256 hidden units, and a fully connected output layer using softmax as activation function. Use Adam optimizer, and cross-entropy for the loss function with learning rate 0.0001. Try a bidirectional RNN with the same settings. Train with 2000 mini batches per epoch. You may see convergence around 5000 epochs. You can change the RNN unit to LSTM or GRUs in both the unidirectional and bidirectional architectures, and experiment with different learning rates and batch sizes. Build a system architecture, as well as hyperparameter and parameter tuning using training and validation data. Finally, for the best architecture among the 6 (pick one!) above (RNN, bi-RNN, LSTM, bi-LSTM, GRU, bi-GRU), make the necessary modifications to update the embeddings along with the rest of the network. This is your 7th and final system. Save your trained systems (i.e., models) using libraries such as callbacks.ModelCheckpoint(...) or model.save_weights(..).

Testing: Apply your trained models (7 total) to test data. Save your output and results in a .txt or a .log file. Results should be in the following format: Word Gold_Standard Prediction SOCCER O O - O O MEXICO B-LOC B-LOC GET O O

Evaluation: Run conlleval.py on your output. Use the get_result function to print out your accuracy in the log file

Reference no: EM133417830

Questions Cloud

Identifying any potential need for your supervision : documenting a plan based on proven standards of care communicating information to colleagues and other relevant people working with the person reflecting on you
Newly formed emergency management department : You just got hired at a newly formed emergency management department in a small hospital in New Orleans.
What is explainable ai why is it significant : What is explainable AI why is it significant? how to build one is it an easy or difficult process to build an explainable AI model? Any current research
Discusses in the video ways that technology can cause stress : discusses in the video ways that technology can cause stress in school or What ways were mentioned in the video that can stress you out and what are ways you
Build the model and tune parameters using training : Build the model and tune parameters using training and validation data, and evaluate the final model (after all development and tuning) with the test data
About advances in racism in field sports discussed : Conduct basic research about advances in racism in field sports discussed. Discuss examples of direct and/or indirect racism that you witnessed in field sports.
What can a special education teacher do to prepare : What can a special education teacher do to prepare when writing the IEP document from the information noted on the MDT? What should the reader of an MDT report
Explain what a first impression is in your own words : Explain what a first impression is in your own words. Describe an experience you have had with a first impression in ANY context.
Identify two things you might suggest to help her with : Suppose you are a social worker at the outpatient leukemia clinic. You often get to know families very well because the typical treatment protocols last one

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd