Implement an object recognition method in python

Assignment Help Python Programming
Reference no: EM132851902

Visual Intelligence

Assignment - Practical work and technical report

Assignment description

The purpose of this assignment is to implement an object recognition method in Python or C/C++ to recognise different objects. Please consult your lecture notes for an overview of object recognition approaches, as well material from the literature. Methods may include geometric (shape) analysis and appearance-based methods. You are provided with a framework to install and use which provides access to Kinect data (see below). You may wish to use OpenCV (opencv.org), a library of programming functions, for real time computer vision. However, bear in mind that your own successful implementation of recognition will attract more marks than using library functions alone (although you may wish to use some of the OpenCV functions to support your implementation.)

The basic requirement is to be able to recognise different objects, presented as streamed depth data from a Microsoft Kinect sensor. To give you an idea of what is expected, consult the following video. You may optionally use RGB (colour) information in your object recognition process.

Note: you are not expected to generate a program that learns objects on the fly. All objects can be learnt offline. Your overall solution should include (1) implementation of an appropriate recognition algorithm; (2) performance evaluation of the recognition algorithm on the test dataset using a confusion matrix.

Project and dataset: You have been provided with:

A CMake C/C++ project (with Visual Studio example) and a Python project, which enable reading of Kinect data (depth, RGB, accelerometer) from a named directory.

Two datasets, one (labelled) for training, and one for testing

To generate the training/test datasets Depth, RGB and accelerometer data has been dumped from a real Kinect (presented with different objects) into a directory that can be loaded by Fakenect. Fakenect is a library which provides an interface to the data without requiring a real physical Kinect. It is not expected that accelerometer data will be used; Depth data should be your primary source of information for recognition.

Note: you are encouraged to use RGB data in addition (you will need to think about calibration and how to register the RGB image to depth) which will attract more marks

Undertaking the assignment: With support provided through the scheduled online interactive session and instructions on Blackboard, install the provided C/C++ or Python project on a lab PC (remotely) or on your own machine. Check that the simple demo of loading and getting access to Kinect data works. Download the datasets provided for the assignment (link to request data is on Blackboard). Start with depth data only and extract frames from the provided training data corresponding to each object, applying a suitably chosen threshold to the depth data. The extracted frames constitute your training dataset. Apply a similar process to the provided test data to generate your test dataset. Extract one or more selected features (e.g. shape and/or colour, based on what you have been taught, outside reading, use of OpenCV, etc.) from your training dataset and build a classifier. Evaluate your classifier on the test dataset and generate a confusion matrix.

Attachment:- Visual Intelligence.rar

Reference no: EM132851902

Questions Cloud

New equipment-to pay cash-rent-get loan or lease : EAGLE SOAR needs new, state of the art equipment. The management team has already identified a vendor for this equipment and a package
Construct anova having sources and df only in integers : In an experiment two factors will be investigated Spray for insecticides having 4 types and 6 types of fertilizers. The researcher wish to carry the experiment
Make a prediction for sales : Observations are taken on sales of a certain mountain bike in 30 sporting goods stores. The regression model was Y = total sales (thousands of dollars)
What is the minimum revenue of a reportable segment : Interest earned on loans to other industry segments, P400,000. Under the revenue test, what is the minimum revenue of a reportable segment?
Implement an object recognition method in python : Implement an object recognition method in Python or C/C++ to recognise different objects. Please consult your lecture notes for an overview of object
What are the reportable segments based on profit or loss : Segment W, P1,000,000; Segment X , (P2,000,000); Segment Y, P400,000; Segment Z, (P200,000). What are the reportable segments based on profit or loss?
Challenges associated with policy and legislative changes : How have legislation and policies changed in the past decade? What are some of the challenges associated with the policy and legislative changes?
How much should the land he recorded in the books : Current assessed values for the land and building, reapectively are P4500000 and 7,000,000. How much should the Land he recorded in the books?
What amount should cleopatra record the land acquired : What amount should Cleopatra record the land acquired in the exchange? Cteopatra Company owns a tract of land that it purchased in 2017

Reviews

Write a Review

Python Programming Questions & Answers

  Design and write a program that will allow a waitperson

Design and write a program that will allow a waitperson to take orders for each table, and calculate and display the per-person total, per-table total.

  Creating a web application for an on-line videogame shop

CE219 Assignment - Creating a 3-tier web application for an on-line videogame shop. The application must be implemented in Python

  Implement a program that starts by asking the user

Implement a program that starts by asking the user to enter a login id (i.e., a string). The program then checks whether the id entered by the user.

  Takes a directory containing files that record

Create a program called grading.py that takes a directory containing files that record how student's performed on their assignments and from these files determines each students grades in addition to the course statistics.

  Create a python script that opens a file and perform task

Create a python script that opens a .mbox file and perform the following: the script will open a file containing a blacklist of words and it will report whether

  Project title email spam filterabstractanalyze the emails

project title email spam filterabstractanalyze the emails and predict whether the mail is a spam or not a spam.to work

  Calculate and display the average of the 5 scores

Each gymnast in a competition receives scores from 5 judges. Write a Python program to do the following. calculate and display the average of the 5 scores.

  Write a single python expression

What is a file object and how is it different from a file name or the file's contents and Write a single Python expression that would evaluate to file extension

  Write the appropriate accessor and mutator methods for each

Write two class es named ProductionWorker and ShiftSupervisor that are subclass es of the Employee class.|

  Write program to calculate the length of the hypotenuse

CIS156 Python Programming Assignment - Triangle Calculator, South Mountain Community College, USA. Write program to calculate the length of the hypotenuse

  Compare and contrast results from two models

Compare and contrast results from TWO models designed to predict levels of potential sale price (fair value) of real estate assets in a specific area

  Extending caesar cypher into a full-fledged password saver

Extending the Caesar cypher into a full-fledged password saver - highly recommend you revert back to the original source code

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd