Resume Classification

An actor resume classification website, deployed via NLP and Multilabel Tech.
Team member: Lu Wang
Involved Tech: Python / NumPy / pandas / nltk / sklearn / genism / AWS EC2 / SQL / Spark / Flask
Link: Website Github

Overview

When a producer/director plans for new movie production, she usually arranges auditions for casting purposes. There will, of course, be many applicants to apply. Without reading the resumes or recommendations for all the applicants through, how could she quickly filter out those disqualified applicants? This system addresses this problem by automatically reading the resumes or recommendations through and recommend the producer/director about promising candidates to arrange an audition.

The initial thought was inspired by my ex-colleague, I hope to use AI, Data science to speed up resume selection progress. However, as individual, it is difficult to collect personal resume information. As an alternative, I think, for actors, some experience shown on wiki could be as part of their resume.

Methods

Conclusion and Further Steps

● One vs Rest transforms the multi-label problem into a set of binary classification problems. ● LDA can be used to measure performance on information retrieval. ● Each industry has its own requirements. Take the movie industry for example, in the initial casting stage, it is usually required to consider both the candidate’s professional experience and her visual appearance. Therefore, in order to improve the accuracy of screening, in addition to improving the resume analysis model, analyzing video, image materials submitted by the candidate is also necessary. ● It should be possible to apply these methods to other industries to help accelerate hiring and screening process.

Reference

https://towardsdatascience.com/topic-modeling-and-latent-dirichlet-allocation-in-python-9bf156893c24

https://radimrehurek.com/gensim/models/ldamodel.html

https://pyldavis.readthedocs.io/en/latest/modules/API.html#pyLDAvis.prepare

https://towardsdatascience.com/journey-to-the-center-of-multi-label-classification-384c40229bff