Build the model and tune parameters using training

Assignment Help Computer Engineering

Reference no: EM133417830

Programming Assignment

Named Entity Recognition (NER) with Recurrent Neural Network (RNN)

Build a recurrent neural network for Named Entity Recognition (NER) on CONLL 2003 dataset. Your task is to classify words into 10 different classes: , O, B-ORG, B-PER, B-LOC, B-MISC, IORG, I-PER, I-LOC, I-MISC. We are identifying whether words are part of a phrase referring to an organization, person, location, or miscellaneous. B indicates that word is at the beginning of the phrase, I indicates that the word is inside the phrase but not the first word, O indicates it is outside the phrase (does not belong to it).

Data: You can find training, test, and validation sets on Blackboard. You will build the model and tune parameters using training and validation data, and evaluate the final model (after all development and tuning) with the test data.

Pre-processing: Read the complete data. First column has the words to be classified, and last column shows the gold standard tag for each word. Lower case capitalized words (i.e., starts with a capital letter) but not all capital words (e.g., USA). Do not remove stopwords. Data is already separated by sentence and tokenized, so do not use different tools to tokenize for this task. Separate data by sentence. Once you know the maximum sentence length in the data, append 0s at the end of shorter sentences to make them match this max length. Set the tag for the 0s to .

Training: Build an RNN. Start with a vanilla RNN, with one layer of 256 hidden units, and a fully connected output layer using softmax as activation function. Use Adam optimizer, and cross-entropy for the loss function with learning rate 0.0001. Try a bidirectional RNN with the same settings. Train with 2000 mini batches per epoch. You may see convergence around 5000 epochs. You can change the RNN unit to LSTM or GRUs in both the unidirectional and bidirectional architectures, and experiment with different learning rates and batch sizes. Build a system architecture, as well as hyperparameter and parameter tuning using training and validation data. Finally, for the best architecture among the 6 (pick one!) above (RNN, bi-RNN, LSTM, bi-LSTM, GRU, bi-GRU), make the necessary modifications to update the embeddings along with the rest of the network. This is your 7th and final system. Save your trained systems (i.e., models) using libraries such as callbacks.ModelCheckpoint(...) or model.save_weights(..).

Testing: Apply your trained models (7 total) to test data. Save your output and results in a .txt or a .log file. Results should be in the following format: Word Gold_Standard Prediction SOCCER O O - O O MEXICO B-LOC B-LOC GET O O

Evaluation: Run conlleval.py on your output. Use the get_result function to print out your accuracy in the log file

Reference no: EM133417830

Questions Cloud

Identifying any potential need for your supervision : documenting a plan based on proven standards of care communicating information to colleagues and other relevant people working with the person reflecting on you

Newly formed emergency management department : You just got hired at a newly formed emergency management department in a small hospital in New Orleans.

What is explainable ai why is it significant : What is explainable AI why is it significant? how to build one is it an easy or difficult process to build an explainable AI model? Any current research

Discusses in the video ways that technology can cause stress : discusses in the video ways that technology can cause stress in school or What ways were mentioned in the video that can stress you out and what are ways you

Build the model and tune parameters using training : Build the model and tune parameters using training and validation data, and evaluate the final model (after all development and tuning) with the test data

About advances in racism in field sports discussed : Conduct basic research about advances in racism in field sports discussed. Discuss examples of direct and/or indirect racism that you witnessed in field sports.

What can a special education teacher do to prepare : What can a special education teacher do to prepare when writing the IEP document from the information noted on the MDT? What should the reader of an MDT report

Explain what a first impression is in your own words : Explain what a first impression is in your own words. Describe an experience you have had with a first impression in ANY context.

Identify two things you might suggest to help her with : Suppose you are a social worker at the outpatient leukemia clinic. You often get to know families very well because the typical treatment protocols last one

User Account

All Pages