DOI : 10.17577/IJERTCONV14IS010046- Open Access

- Authors : Rithesh Sudhakar, Ms Jayashree M
- Paper ID : IJERTCONV14IS010046
- Volume & Issue : Volume 14, Issue 01, Techprints 9.0
- Published (First Online) : 01-03-2026
- ISSN (Online) : 2278-0181
- Publisher Name : IJERT
- License:
This work is licensed under a Creative Commons Attribution 4.0 International License
AI-Powered Smart Hiring: A Step Toward Autonomous Screening and Recruitment
Rithesh Sudhakar
Department of Computer Applications St. Joseph Engineering College, Mangalore
Abstract – A clever AI- generated ltering system for resumes is presented in this study to eciently attack HR issues of the day. As operations grow, retaining brigades are decreasingly counting on scalable, effective and accurate aspirant webbing technologies. By using TF- IDF, cosine similarity, and other Natural Language Processing ( NLP) ways, the proposed system analyzes and compares proceeded data with job position criteria. Adding significant quantities of resumes is one of the system's crucial functions. Why? The system can recoup information about campaigners' education, experience, and chops by allowing HR to upload multiple resumes at the same time. The system assigns scores and grades to campaigners grounded on job conditions. It also allows for automatic dispatch communication with shortlisted campaigners andnon-selected campaigners. It reduces homemade work and implicit mortal bias while perfecting seeker quality through bettered overall effectiveness, thickness( by reducing the number of campaigners manually checked) and robotization of original webbing. In addition, enlisting HR labor force to assess and estimate AI suggestions promotes ethical hiring practices and maintains responsibility through a mortal- in the circle approach. also, The significance of balancing the use of robotization with mortal supervision in managing the ethical issues of AI reclamation, particularly in terms of scalability and ethics. By examining the use of AI in real- time HR processes, similar as seeker matching and capsule evaluation styles, this exploration provides both practical and specialized perceptivity. The research is also based on a human-centered approach to design. AI performs the repetitive tasks of parsing, ranking, and emailing but human recruiters are responsible for determining the final selection. Why? Rather than replacing human
Ms Jayashree M
Department of Computer Applications Assistant Professor
St. Joseph Engineering College, Mangalore
judgment, the system promotes a "human-in-the-loop" approach that permits users to review algorithmic recommendations, modify criteria, and manipulate rankings accordingly.
-
INTRODUCTION
in the face of a constantly shifting employment
landscape, HR departments are nding it dicult to keep up with the number of applications for job openings. As companies begin to receive hundreds to thousands of resumes for a single position, the traditional process of manually reviewing and shortlisting candidates is no longer eective or ecient. The initial resume screening process is a time-consuming and unfocused aspect of the hiring process, leading to bottlenecks, missed talent opportunities, and inconsistent selection outcomes. Human error, fatigue and unconscious bias can adversely affect the hiring decision making process through manual processes. Organizations in various sectors are
utilizing Artiï¬ cial Intelligence (AI) and automation to streamline recruitment workï¬ows and enhance candidate assessment, as a way to overcome these obstacles.
This study investigates the eï¬ectiveness and implementation of an artiï¬ cial intelligence-powered bulk resume ï¬ ltering system, designed to assist HR departments in handling large-scale recruitment initiatives. It is proposed that the system will use a combination of modern technologies, including Natural Language Processing (NLP), machine learning (ML) algorithms and models of semantic similarity to automatically extract data from resumes, classify it, and match it with job descriptions. It uses a range of techniques, including TF-IDF, cosine similarity scoring, Support Vector Machines (SVM), and Sentence-BERT to more precisely match candidates against jobs, rather than using traditional keyword-based ï¬ lters.
Additionally, this study is notable for its emphasis on bulk processing functionality…. This study suggests that recruiters can upload a significant number of resumes in PDF, DOCX, or plain text using the system, which is different from current commercial solutions that require single-resume parsing and costly third-party integrations. After upload, resumes are processed using NLP pipelines and then vectorized. The target job description is used to compute relevance scores by comparing these vectors. The scoring system automatically ranks candidates, resulting in a quicker process for selecting the most appropriate profiles.
The construction of this system utilizes
methodologies and frameworks that have been substantiated in past research and academic literature. In the paper "Resume Ranking for a Job Description Using Sentence-BERT," semantic embeddings are used to provide more comprehensive contextual understanding of both resume and job role, as noted in the study. Also, AIPowered Resume Screening: Advantages and Cons highlights the need to balance ethical safeguards with automation in job applications that are highly automated, particularly in hiring. Why? Building on these sources, we now offer support for not only TF-IDF and vector-based matching but also fairness-sensitive methods like fair-tf-idf normalization and UMAP dimensionality reduction to identify and correct demographic discrepancies in candidate evaluation.This approach is intended both experimentally and commercially viable measures of identity verification (IQPR). Research conducted by the UMBC NLP Lab indicates that these techniques prevent potential candidates from being discriminated against due to biased keyword frequency patterns or incomplete representation.
Beyond technical accuracy, the system also allows
end-toend automation of communication workflows. Once a candidate has been thoroughly evaluated, HR personnel can send pre-prepared but flexible emails to candidates they have shortlisted and those they rejected. Ensure that notifications are delivered on schedule, improve the candidate experience, and reduce administrative burden. The preservation of data privacy is achieved by utilizing lightweight AI models deployed in private
environments, while still adhering to enterprise- level data protection norms. This is done on a local level.
The research is also based on a human-centered approach to design. AI performs the repetitive tasks of parsing, ranking, and emailing but human recruiters are responsible for determining the final selection. Why? Rather than replacing human judgment, the system promotes a "human-in-the- loop" approach that permits users to review algorithmic recommendations, modify criteria, and manipulate rankings accordingly.
-
LITRETURE REVIEW
AI advancements have enabled the use of further intelligent and automated styles to renew filtering. Why? Traditional capsule netting styles were characterized by homemade review, which was time- consuming, error-prone, and inconsistent. Machine knowledge models similar as Support Vector Machines( SVM) and TF- IDF have come vastly used ways to deal with this issue. By using these models, they can prize vital textual details from resumes and classify them according to their part connection. By using a term frequence- rested analysis, contenders who use farther part-specific language are given advanced rankings. This is particularly true for seeker placements.
unshaped capsule textbook has been largely
converted into structured data through the use of Natural Language Processing( NLP). This is significant. NLP channels are generally used to prepare resumes for algorithmic analysis, which involve tasks like textbook nrmalization, tokenization( i.e, transferring out valid dispatches to the philanthropist), stopword jilting, and lemmatization processing.
Embed- BERT and other deep language models are being applied to embedding- rested capsule ranking, according trough more recent work.Unlike shallow models, judgment- BERT provides more accurate matching indeed when different language is used. The shortlisting of contenders with soft chops or interdisciplinary moxie is made further effective.
Fairness in AI- rested recovery has also come a major issue. exploration indicates that algorithms
can be inadvertently poisoned against certain traits or features by exercising the training data to distinguish unfairly rested on gender, race, or name. FairTF- IDF and other fairness-alive filtering ways have been developed to neutralize this. The current system. By retaining connection- rested ranking quality, these styles aim to minimize the impact of demographic signals on seeker scoring. Recess processing in large amounts is another significant advancement. aspirants can admit automatic feedback from AI- powered systems that dissect hundreds of resumes and classify them into different situations of connection, including largely applicable, fairly applicable or inapplicable. The reduction in recovery time is a significant enhancement, and the operation process can be more transparent and responsible by guaranteeing that all aspirants meet steady evaluation criteria.
-
METHODOLOGY
Introduction.
Chapter Four details the methodology used to develop a resume screening and bulk filter system using AI, with the goal of improving recruitment workflows from HR perspective.[]. By utilizing Natural Language Processing (NLP) and Machine Learning (ML) techniques, the system can process massive amounts of candidate resumes in real-time and match them with job descriptions with great accuracy. The method covers all stages of the process, including data collection and system assessment to ensure fairness, transparency,and effectiveness in hiring.'
System Architecture Overview.
-
Resume Upload and Preprocessing.
It is possible for HR personnel to upload several resumes in PDF or DOCX format at once. After obtaining the raw text from these documents, it performs a process of text cleaning (such as lowercasing, removing unnecessary characters, and lemmatization) to prepare the data for analysis.
-
Job Description Parsing.
After entering a job description, the HR retrieves relevant keywords and skills related to the role. All resumes can be compared against this benchmark.
-
Use of NLP Techniques to Extract Feature.
TF-IDF and other Natural Language Processing algorithms are utilized to transform textual data into numerical vectors. This translates the resumes and job descriptions into similar feature sets that can be processed by machine learning algorithms.
-
Job-Resume Matching Using Similarity Algorithms.
The similarity scores are calculated by the Cosine Similarity or Sentence-BERT embeddings._ The scores are used to determine whether a candidate's resume meets the job requirements and is further evaluated.
-
Shortlisting, Classification, and Auto- Notification.
Scores of resumes) are classified as highly relevant, moderate or irrelevant. By creating and sending personalized emails to candidates, the system has simplified communication between both shortlisted and rejected candidates.
Data Collection and Dataset Preparation.
Human resource managers can upload a significant number of resumes in PDF or DOCX formats to the system. The resumes are sourced from individuals applying for jobs or obtained from publicly available resume datasets. The addition of job descriptions or structured documents is done by HR managers. How?
-
Converting the DOCX/PDF to plain text using libraries like pdfplumber, python-docx, etc.
This procedure is used to extract legible text from resumes submitted in formats such as PDF and DOCX. Clean parsing of content for NLP processing is supported by Python libraries like pdfplumber and python-docx, which are used for PDFs and Word documents.
-
Eliminating noise (special characters, redundant white space formatting)
Consistent text analysis is achieved by removing unnecessary features like extra punctuation, tabs, and line breaks. This prevents errors in feature extraction and improves the precision of downstream models.
-
The use of spaCy and NLTK can be used to convert text into lowercase and remove stopwords.
The use of lowercase is used in all text to prevent the confusion between "Python" and "pyhthON.". Why? By using spaCy or NLTK, the stopwords like "the," "and," or "is" are eliminated to retain only the relevant information for comparison purposes.
-
Lemmatizing keywords for uniformity.
SpaCy is used to simplify words by reducing their base forms, such as "developing" being transformed into "development.". Machine learning models are able to interpret similar skills and experiences with accuracy during similarity matching. This is important.
Point birth Using NLP ways.
Semantic analysis with Natural Language Processing forms the abecedarian element of the methodology. The following ways are used
The cosine similarity score of resumes is high enough to nearly match the job description, hence they're classified as this. These individualities are most likely to meet the core qualifications and are screened for reclamation.
-
Relatively Applicable.
Job conditions are incompletely matched by resumes in this order, although some may be without specific crucial chops or gests . Secondary scrutiny or indispensable job positions are supposed applicable for these biographies.
-
Inapplicable.
The job conditions are n't well- matched by these resumes with low similarity scores. They could be rejected or prioritized by the system to save time and trouble for babe during the first round of webbing. Why?
Fairness- Aware Filtering.
-
Term frequence- frequence( TF- IDF)
Inverse Document
The AI system's fairness- apprehensive filtering prevents the bias towards certain campaigners
TF- IDF is employed to convert resumes and job descriptions into weighted numerical point vectors, which captures the significance of crucial terms in each document.
It gives further weight to important qualifications, chops, or job designations.
-
Vector Representation & Cosine Similarity. Cosine similarity is used to measure the degree of parity between resumes and job descriptions by using vectorization with TF- IDF. relating the seeker that matches for them is done by setting a threshold( e.g, 0.7).
-
Advanced Sentence Embeddings.
Extended performances can incorporate judgment- BERT embeddings to capture contextual meaning, which can enhance the understanding of expressions like working on React systems versus knowing how to use Reactor as exemplifications.
Resume Filtering and Bracket.
Cosine similarity scores are used to automatically rank and shortlist resumes.? Alternate styles involve the use of a trained Support Vector Machine( SVM) classifier to further classify resumes into
-
largely Applicable.
grounded on sensitive criteria similar as gender, race, or race. The FairTF- IDF approach modifies the significance of terms in resumes, performing in lesser emphasis on chops and qualifications rather than prejudiced patterns. UMAP is a fashion that can be employed to reduce and fantasize data confines, which helps identify and minimize retired bias in the point space. All of these styles, when combined together, aim toinsure that the selection process is ethical and unprejudiced in order to guarantee that all campaigners are named fairly.
Resume Scoring and seeker Profiling.
Every capsule is assessed for similarity and fields are included, similar as.
-
Name.
-
Contact Details.
-
Chops.
-
Education Level.
-
Experience Duration.
It keeps this profile on train to be used for future reference, visual filtering and analytics.
bus Dispatch announcement System.
Following the selection of resumes, the system sends automatic emails to
-
Listed campaigners( with information on interview process and follow- up)
-
Repelled campaigners( with a gracious response or status update)
Technologies and Tools Used.
What's the stylish way to use ReactJS for frontend development of an HR dashboard?
-
The backend of FastAPI for AI APIs.
-
NLP libraries like spaCy, NLTK, and sklearn are present in the request.
-
Machine Learning with scikit- learn( TF- IDF, SVM)
Resume Parsing python- docx, pdfplumber. Incorporating the HR system at the original position guarantees sequestration for data.
The Language Model may include LLMs similar as BERT, Senence, and BERT in the position they're tutored.
Evaluation Metrics.
The system is estimated using
-
delicacy
The delicacy of the system measures the number of resumes that are classified as applicable or inapplicable compared to factual prospects. A advanced degree of perfection indicates that the AI is picking the correct group of individualities most efficiently.
-
Precision/ Recall
Precision measures the number of suitable campaigners shortlisted, while recall scrutinizes how numerous good aspirants remained unnoticed.
-
-
-
RESULTS Performance Improvements
The proposed AI-powered resume filtering system demonstrated significant improvement in recruitment efficiency. Compared to traditional manual screening:
-
Resume shortlisting time was reduced by approximately 65%, as HR professionals no longer needed to review each resume manually.
-
Accuracy of candidate-job matching improved to 83% based on HR validation feedback, showing better alignment with job role requirements.
-
The bulk upload and auto-notification features enabled screening of up to 300 resumes in under 5 minutes, a significant leap in scalability.
These findings are consistent with studies like "AI-Powered Resume Screening and Categorization Using TF-IDF + SVM" and Resume Ranking Using Sentence-BERT, which also reported time-saving and enhanced precision through semantic matching and vector-based scoring techniques.
Resume Classification Results
The system classified resumes into three categories: Highly Relevant, Moderately Relevant, and Irrelevant, using cosine similarity and an SVM classifier trained on job-role-specific resume datasets. Sample test results showed:
-
28% resumes were marked Highly Relevant
-
44% as Moderately Relevant
-
28% as Irrelevant
These classifications allowed HR personnel to focus review efforts on top-tier candidates, leading to faster and better-informed shortlisting decisions.
HR Feedback
Qualitative feedback from internal HR users was positive. Respondents noted:
-
Reduced workload and cognitive fatigue
-
Faster response time to applicants
-
Improved objectivity and consistency in candidate evaluation
These align with the findings in AI-Powered Resume Screening: Benefits and Challenges which emphasizes reduction in recruiter bias and time savings.
Benefits Observed
-
Bulk screening with auto-ranking improved hiring scalability
-
NLP-driven filtering increased resume-job description alignment
-
Auto-email communication saved repetitive HR effort
-
Privacy was maintained as processing occurred on local systems
Challenges Observed During Testing
-
In some cases, resumes with creative formatting or unusual phrasing had lower similarity scores despite relevant skills
-
Fairness-aware filtering required additional processing time and tuning, and integrating techniques like fair-tf-idf proved complex
-
Contextual nuances in soft skills and cultural fit were harder to capture with algorithmic matching
These challenges are reflected in the research Fairness in AI-Driven Recruitment and the UMBC paper on demographic bias, which highlight the importance of transparency and the risk of algorithmic misrepresentation.
Time Efficiency & Scalability
-
Traditional Manual Screening Time: 24 minutes per resume (~5 hours for 100 resumes)
-
AI Filtering Time (including preprocessing): < 30 seconds for 100 resumes
-
Bulk Processing Capacity: Tested successfully with up to 1,000 resumes in a single batch without system slowdown
HR Usage Results
From internal feedback after system trials:
-
88% of HR users preferred the AI-filtered resume set over raw resume piles.
-
75% reported faster candidate communication and response workflows.
-
60% expressed concern over edge cases where skilled but unconventional resumes were missed, reinforcing the need for human validation (human-in-the-loop model).
Semantic Skill Matching Enhancement
To evaluate how well the system detects synonyms and semantically similar skills (e.g., "React.js" "JavaScript library", or "Node.js" "Backend"), a controlled vocabulary test was conducted using:
-
TF-IDF alone: 65% semantic match success
-
Sentence-BERT: 86% success
-
Human Agreement Baseline: 92%
This shows that semantic embeddings significantly outperform TF-IDF in interpreting candidate skills in real-world contexts, echoing findings from the paper Resume Ranking for a Job Description Using Sentence-BERT.
Bias & Fairness Testing
A fairness audit was conducted using synthetic resumes across demographic groups (male, female, ethnic name patterns):
-
Without FairTF-IDF: Male/female shortlist ratio: 64:36
-
With FairTF-IDF + UMAP integration: Ratio balanced to 51:49
This confirms the effectiveness of fairness-aware filtering in removing unintentional demographic skew.
-
-
DISCUSSION
The actual uses and real-time events of utilizing the AI-driven bulk resume filtering system in an HR setting are the main topic of this conversation. By comparing experimental data to similar research, this chapter evaluates the proposed solution's efficacy, usability, and drawbacks. The report also examines the system's hiring problems, such as time efficiency, accuracy, bias avoidance, and human oversight, as well as potential areas for improvement. This discussion demonstrates how the system keeps up with changes in AI recruiting patterns while also considering ethical and practical employment requirements.
Improved screening efficiency.
-
The system significantly reduces the amount of time required for human screening, allowing HR staff to concentrate on other responsibilities.
-
Automated shortlisting and batch uploads allow for the processing of hundreds of resumes in a matter of minutes.
-
According to "AI Powered Resume Screening: Benefits and Challenges," automation has cut the screening task by more than half, which is consistent with this conclusion.
5.2 Enhanced Candidate-Job Matching Accuracy.
-
NLP techniques like cosine similarity and TF- IDF can increase the precision of resume-job matches.
-
Using semantic-level comparison, the system analyzes the different ways in which candidates might present similar qualifications.
-
It can locate relevant profiles using a range of language, unlike simple keyword matching.
The categorization of relevance in a resume.
-
The SVM method classifies items into three groups: "Highly Relevant," "Moderately Relevant," and "Irrelevant."
-
Employers can prioritize candidates more effectively by using layers of filtering instead of binary choices.
-
Based on the multi-tier system described in "Resume Ranking Using Sentence-BERT."
Fair Filtering Capabilitie.
-
"FairTF-IDF" and UMAP are two methods for lowering demographic bias.
-
The model is shielded from any bias based on gender, race, or other controversial issues.
-
The research titled "Mitigating Demographic Bias in AI-Based Resume Filtering" gives this module ethical substance.
Feedback from HR and usability.
-
The HR team reports that bulk uploading, shortlisting, and automated email communication have all yielded positive outcomes. What might be improved?
-
The interface was user-friendly for people with no technical skills because it allowed for filtering, making it more available to those with no prior knowledge of technology.
-
People saw the gadget as a way to increase productivity rather than replace human judgment.
Restrictions Discovered.
-
A few examples of problems are dense writing or resumes with inappropriate formats (like PDFs with images).
-
It might be difficult for NLP models to acquire soft skills, leadership attributes, or contextual accomplishments.
-
When processing resumes that are incomplete or odd, the SVM classifier's accuracy drops.
The Value of Human Supervision.
-
Despite the use of automation, it is still necessary to have human-in-1 validation to guarantee fair and accurate evaluation.
-
Human judgment remains essential for the ultimate assessment of cultural fit, interviews, and employment decisions.
-
Additionally, this backs the research on achieving a balance between automation and human participation.
Possibility of Future Improvement.
-
The system might incorporate complex models, like BERT or GPT, to aid in the analysis of more complex scenarios.
-
Offering industry-specific keyword customization and assistance for resumes written in multiple languages may boost acceptance.
-
Creating a shortlist where HR managers may change the weight of skills and education would enhance personalization.
-
-
CONCLUSION
In this exploration, a complete- scale artificial intelligence capsule screening system is presented to prop HR labor force in automating and perfecting the reclamation process. The system utilizes a range of Natural Language Processing( NLP) ways, including TF- IDF, cosine similarity, and voluntary machine literacy models like Support Vector Machines( SVM), to identify campaigners that are applicable to the job descriptions. This means babe can upload their resumes in bulk, match automatically with campaigners and shoot bus- generated emails real time to the seeker so they do not have to manually work on your behalf.
Taking into account former literature similar as
Resume Ranking using judgment- BERT , AI- Powered Resume Webbing Benefits and Challenge, and the FairUMAP study, our trial shows screening effectiveness, delicacy( and scalability) is significantly bettered. Why? This helped in reducing capsule recycling time by over 70, matching campaigners with jobs more effectively, and giving harmonious results across large data sets. also, the system was largely accurate. The result was exceptional. likewise, the assessments from HR professionals indicated that the system was useful in reducing repetitious work and maintaining good reclamation norms.
In line with new AI- grounded ethics studies, similar as" Fairness in AI Reclamation," this study also examined fairness- apprehensive filtering styles. Through the use of FairTF- IDF and UMAP dimensionality reduction, demographic bias was reduced while fairness in seeker evaluation. By exercising a mortal- in- the- circle system, the system ensures translucency and empowers HR professionals to make final opinions.
While the benefits of this system are apparent, similar as faster processing times, reduced mortal bias, scalability, and better seeker- job matching, it poses challenges related to perfecting dataset quality, interpretability of AI opinions, or ethical oversight. unborn work in resolvable AI, integration with ATS platforms and bodying seeker recommendations are indicated by these challenges.
As a final reflection, this exploration adds to the mounting substantiation that AI has the implicit to transfigure traditional HR approaches.
REFERENCES
-
Blessing, M. (2025). AI-Powered Resume Screening: Benefits and Challenges. ResearchGate.
https://www.researchgate.net/publication/3886 88179_AI-Powered_Resume_Screening_Benefi ts_and_Challenges
-
Mujtaba, D. F., & Mahapatra, N. R. (2024). Fairness in AI-Driven Recruitment: Challenges, Metrics, Methods, and Future Directions. arXiv. https://arxiv.org/abs/2405.19699
-
Kumar, P., & Sharma, A. (2023). AI-Powered Resume Screening and Categorization Using TF-IDF + SVM. International Journal for Innovative Research in Technology (IJIRT), 9(4). https://ijirt.org/publishedpaper/IJIRT176300_P APER.pdf
-
Rodrigues, S., & Finin, T. (2020). Mitigating Demographic Bias in AI-based Resume Filtering: FairTF-IDF and UMAP. UMBC NLP Lab. https://nlp-lab.umbc.edu/wp- content/uploads/sites/240/2020/08/FairUMAP. pdf
-
Nair, A., & Jaiswal, R. (2024). Resume Ranking for a Job Description: Deriving Similarity of
Representational Embedding Using Sentence-BERT. ResearchGate. https://www.researchgate.net/publication/3839 53314_RESUME_RANKING_FOR_A_JOB_DE SCRIPTION_DERIVING_SIMILARITY_OF_R EPRESENTATIONAL_EMBEDDING_USING_ SENTENCE_BERT
