thesis-tagger

This project can be used to tag chunks of documents with topics. In this case the documents are job ads. The project consists of a node.js server connected to a MongoDB database and a simple web interface that accesses the server via a JSON API.

Prerequisites

Node.js
MongoDB
python
jupyter / ipython

Data preprocessing

The jupyter notebook 'extractjobs.ipynb' includes the necessary steps to perform the preprocessing on the data. It is assumed here that the data is in the format as described / used in this notebook. In essence this chops up the job ads into chunks and saves the data into csv files. These can then be imported in MongoDB as described in the notebook.

Server setup

Clone the project
run 'npm install' in the 'server' directory to get the necessary dependencies
run the server via 'node server.js' and access the interface via http://localhost:8082/jobad-tagger/

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
server		server
.gitignore		.gitignore
pre-processing.ipynb		pre-processing.ipynb
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

thesis-tagger

Prerequisites

Data preprocessing

Server setup

About

Uh oh!

Releases 3

Packages

Uh oh!

Languages

cle-ment/ma-thesis-crowdsource-text-tagger

Folders and files

Latest commit

History

Repository files navigation

thesis-tagger

Prerequisites

Data preprocessing

Server setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Languages

Packages