Information Retrieval from Job Posts based on K-means++ Clustering Algorithm
DOI:
https://doi.org/10.3126/jost.v5i1.93045Keywords:
Information Retrieval, K-means, K-means++, Elbow Method, Silhouette Analysis, Machine Learning, Discounted Cumulative GainAbstract
The research paper deals with two main sections: firstly, the experiment comparison between k-means and kmeans++ have been done using Elbow method and Silhouette method. Since, K-means++ is better than K-means, this research tries to justify that K-means++ has higher performance than K-means. Secondly, K-means++ has been used for Search and Information Retrieval system. Information Retrieval is an activity to obtain information system resources that are relevant to an information need from a collection of those resource. This research is useful to retrieve relevant documents that match a given query. When user add input such as industry type, job types, skills, and state, it will automatically calculate average and display the ranking. Subjective evaluation with DCG(Discounted Cumulative Gain) is done in order to measure ranking quality of information retrieval.