This repository contains a simple search engine code in Indonesian using the TFIDF method. This code also uses the cosine similarity method to perform ranking. The dataset used is obtained from the Indonesian Wikipedia article.
-
Notifications
You must be signed in to change notification settings - Fork 0
Developed a simple search engine utilizing the TF-IDF algorithm for efficient information retrieval. The project involves text preprocessing, vectorization using TF-IDF, and ranking documents based on their relevance to the user's query. Designed to demonstrate fundamental concepts in NLP and Information Retrival
zuhdihanif/SimpleSearchEngineTFIDF
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Developed a simple search engine utilizing the TF-IDF algorithm for efficient information retrieval. The project involves text preprocessing, vectorization using TF-IDF, and ranking documents based on their relevance to the user's query. Designed to demonstrate fundamental concepts in NLP and Information Retrival
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published