ESCO Skill Extractor

This is a a tool that extract ESCO skills from texts such as job descriptions or CVs. It uses a transformer and compares its embedding using cosine similarity.

Installation

pip install esco-skill-extractor

Usage

from esco_skill_extractor import SkillExtractor

# `device` kwarg is optional and defaults to 'cpu', `cuda` or others can be used.
# `threshold` kwarg is optional and defaults to 0.4, it's the cosine similarity threshold.
skill_extractor = SkillExtractor()

ads = [
    "We are looking for a software engineer with experience in Java and Python.",
    "We are looking for a devops engineer. Containerization tools such as Docker is a must. AWS is a plus."
    # ...
]

print(skill_extractor.get_skills(ads))

# Output:
# [
#     [
#         "http://data.europa.eu/esco/skill/ccd0a1d9-afda-43d9-b901-96344886e14d"
#     ],
#     [
#         "http://data.europa.eu/esco/skill/f0de4973-0a70-4644-8fd4-3a97080476f4",
#         "http://data.europa.eu/esco/skill/ae4f0cc6-e0b9-47f5-bdca-2fc2e6316dce",
#     ],
# ]
# ]

How it works

It creates embeddings from esco skills found in the official ESCO website.
It creates embeddings from the input text (one for each sentence).
It compares the embeddings of the text with the embeddings of the ESCO skills using cosine similarity.
It returns the most similar esco skill per sentence if its similarity passes a predefined threshold.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
esco_skill_extractor		esco_skill_extractor
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ESCO Skill Extractor

Installation

Usage

How it works

About

Releases

Packages

Languages

License

KonstantinosPetrakis/esco-skill-extractor

Folders and files

Latest commit

History

Repository files navigation

ESCO Skill Extractor

Installation

Usage

How it works

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages