Implement Inverted Index and Query Mechanism for a Input set of documents using Hadoop.
$50-100 USD
Avslutat
Publicerad över fyra år sedan
$50-100 USD
Betalning vid leverans
Design a MapReduce-based algorithm to calculate a simple inverted index over the input set of files. Your map function should extract individual words from the input it is given, and output the word as the key and the current filename as the value. Write a Query program on top of your inverted file index in Hadoop, which will accept a user-specified word and return the IDs of the documents that contain that word.
Now create a full inverted index (i.e., a positional index), which maps words to their document IDs and positions in the documents. You will specify a word’s position in the document by using the byte offset of the line that it appears in. Your map function should extract individual words from the input it is given, and output the word as the key, and the current filename and byte offset as the value.
hi my name is imran working as Big Data Hadoop administration I will create cluster on AWS Azure GCP data centre to deploy your Hadoop cluster on it.
Hadoop administration related work I will do it for you.