Implement the system in any programming language

Assignment Help Python Programming

Reference no: EM132519292 , Length: 5 pages

Multimedia Security

Audio Fingerprinting Practical Work

Multimedia fingerprinting (also known as perceptual hashing) are a class of techniques used to produce short summaries of multimedia content. These summaries can then be used to search for these pieces of content in databases in an efficient way. Typical uses of perceptual hashing are reverse search, by which a piece of multimedia itself (instead of the file metadata, such as the name of the piece or its author's name) can be used to query a database; and copy detection to identify unauthorized copies of copyright-protected work or their distribution. An example of the first use would be the Shazam service1, which can be used to identify a song in a database from a short, often noisy, recording from a smartphone's microphone. An example of the second use would be Toutube's ContentID2. More details on media fingerprinting can be found in the slides for Unit 5 T5 - Copy Detection.

Task description

This practical work consists on implementing an audio fingerprinting system for reverse search database search, similar to the Shazam service, that is capable of identifying songs from short noisy samples. In the Practical Exercices section in the course moodle page, you will find a music library named Music Library part 1 and Music Library part 2, which contain 40 songs in total. The provided songs are wave files (.wav) with 2 audio channels, sample rate of 44100 samples per second and 16 bits per sample, making them 16 2 44100 t in size each, being t the song duration in seconds. Also, in the same moodle page section, you can find a selection of test samples, extracted from the songs in the Music Library, that you can use for testing. All samples have a random duration of 8 to 15 seconds and are divided in 4 categories:

Clean samples are extracted direclty from the songs in the Music Library and have no aditional processing.

Through a landline phone.

Noisy samples have been mixed with the noise samples also available in the moodle course page. The volume of the song with respect to the background noise is random for every sample (some of the samples are barely audible).

Noisy filtered samples combine the two previous processing actions, they should emulate noisy samples recorded with a bad quality micro- phone.

Your work is to implement two utilities, one that builds a fingerprint database from the songs directory and a second that identifyies a song from the database using one of the samples. You can implement the system in any programming language of your choice. The application design and archi- tecture are up to you. Note: The paper describing Shazam is in the moodle course page in the complementary reading section of Unit 5. Use that paper as your guide. However, the commands to build the fingerprints database and to identify a sample must be of the following form:
• Building the fingerprint database: builddb -i songs-folder -o database-file
• Identifying a sample: identify -d database-file -i sample
The first of the utilities does not need to return anything in particular (except for building the database). The second one should return the song name in the console. This does not mean that the sofware should be compiled. If you develop the programs in an interpeted language such as python or deliver a Java .jar file indicate so in your documentation. This means that the following examples are also valid:
• python builddb.py -i songs-folder -o database-file
• java -jar builddb.jar -i songs-folder -o database-file
• builddb.sh -i songs-folder -o database-file

But be completeley clear in how to execute the commands in the provided documentation.

Attachment:- Audio Fingerprinting Practical Work.zip

Reference no: EM132519292

Questions Cloud

Overall prices of the factors of electricity increase : If proportion of general sales tax (GST) on electricity is increased by the Govt. of Bangladesh.

Object-oriented design versus traditional approach : Compare the object-oriented approach to design to the traditional approach.

How you reconcile equality versus equity in public education : In the assigned readings and videos, the Heritage Foundation and Peter Sagal seem at odds in their respective positions toward the 14th Amendment.

Avc for the firm to operate at a loss and not shut down : Given this, show that price must exceed AVC for the firm to operate at a loss and not shut down?

Implement the system in any programming language : Implement two utilities, one that builds a fingerprint database from the songs directory and a second that identifyies a song from the database

Identify the legal issues presented by the classifications : Research the implications of equal protection for K-12 students within one of the following groups: Classifications based on English language learners.

Are there markets where you see a need for price controls : In our mixed economic system, who really decides prices? Producers or Consumers?

Integrated big data analytics : Which highlights how businesses have integrated Big Data Analytics with their Business Intelligence to gain dominance within their respective industry.

Can a oil producing firm make a economic profit : Can a oil producing firm make a economic profit in a long run after the oil price return to where before the corona virus pandemic?

User Account

All Pages