Skip to content

EggsyOnCode/search-engine-news

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

79 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Search Engine For News

Group Members

  • Farzan Saqib
  • Haroon Tahir
  • Rohaan Ahmad

Setting Up

About The Project

Simple Python based web server to answer search queries.
To set it up you need to install some python libraries (refer to setmeup.txt)
Run RUNSERVER.PY
Then cd into UI and enter the following code "npm run start"
Before searching anything please make sure you have created lexicon/forward index/ inverted index
Upload file optionis given on search page

Working

Everythig starts with document uploadation.
Soon after you upload document the lexicon,
forward & inverted index are created.

Lexicon

Lexicon is stored in dictionary/hashtable

Forward Index

After lexicon generation, inverted index in created.
As searching is done on inverted index, therefore 
while creating forward index we eliminated duplicates
by creating buckets on docID.

Inverted Index

For generating Inverted Index the dataset is divided, multiple(2) 
threads are created and temporary inverted index is created,
afterward we combine these temporay index. After combining temporary 
indexex are deleted. Information about hitlists canbe obtained from 
forward index.

Search

For searching enter the required word or phrase in text box,
that word or phrase(if phrase divide in small words) is 
passed to invereted index to get result.

Frontend

Frontend is mostly in HTML, CSS & React

Backend

Backend is in Python.

Based on Search Engine Implementation Large-Scale Hypertextual Web Search Engine

DSA.PRJECT.DEMO.mp4

About

A Search Engine built ontop of NELAGT-2002 dataset provided by Harvard.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published