Release notes archive
November 25, 2025π︎
Enhanced
- Improved recognition of French driver's licenses
- Fixed an issue with non-ASCII characters in email prefixes
November 20, 2025π︎
Enhanced
- Fixed an issue with non-ASCII characters in email prefixes
November 13, 2025π︎
Enhanced
- Improved normalization of "staff physician" jobs
- Various IT networking skills added to the skills taxonomy
- Various improvements to French profession normalization
November 12, 2025π︎
Enhanced
- Improved recognition of British phone numbers
November 5, 2025π︎
Enhanced
- The TX platform documentation has been updated to provide some additional details about match v2.
September 4, 2025π︎
Enhanced
- Improved Country detection for LLM Parser.
September 1, 2025π︎
Enhanced
- Improved recognition of remote/hybrid jobs
- Improved US state classification: Washington state vs Washington D.C.
August 21, 2025π︎
Enhanced
- When normalizing professions for English resumes, US dialect will now be used as the default output.
August 14, 2025π︎
Enhanced
- Fixed a bug preventing user specified queries and match queries from being combined.
August 4, 2025π︎
Enhanced
- Improved name extraction from German CVs
July 24, 2025π︎
Enhanced
- Query string and query parts can now be specified on match queries.
- Fixed an issue preventing match queries from generating in Match V2.
June 5, 2025π︎
Enhanced
- Improve date extraction for LLM Engine.
May 27, 2025π︎
Enhanced
- Fixed an issue preventing resumes parsed by the LLM Parser from indexing.
May 26, 2025π︎
Fixed
- Fix for Canadian addresses being extracted as french names.
May 6, 2025π︎
Enhanced
- Better location parsing for LinkedIn profiles that have location terms such as 'greater', 'metro', etc.
May 1, 2025π︎
Fixed
- Fix for Resume Formatter where bullets in DOCX templates would get changed to another character.
April 22, 2025π︎
Enhanced
- Get Skills Taxonomy V2 now includes skill definitions.
April 14, 2025π︎
Enhanced
- Improved name parsing in case a middle name is present
March 25, 2025π︎
Enhanced
- Customers now have a way to test changes in a non-production environment for Search & Match V2 on Tx. When an environment is generated a copy is made with the name showing as
ACC. Users can test their changes to the config in the Tx portal on their ACC env and then promote those updates up to Production once they are satisfied. - Auto-Complete languages were not being taken into account when provided as a parameter to the API endpoint. Now they are considered and results are filtered on that language.
- Auto-complete endpoints have been added to Tx documentation and swagger pages.
March 12, 2025π︎
Enhanced
- ONET 2010 is no longer returned when normalizing professions.
February 20, 2025π︎
Enhanced
- HTML conversion now outputs additional classes to specify the type of data that is to be anonymized.
- LLM parser now outputs when found: Drivers License, Date of Birth, Gender, and Nationality.
February 13, 2025π︎
Enhanced
- LLM CV Parser now outputs job descriptions under work history positions.
- Nationality is now normalized to a two-letter country code and output in a new field, if extracted.
- HTML is now returned with
tx-redactedclass on any elements containing known PII.
February 3, 2025π︎
Enhanced
- Improved handling of salary information coming from structured input
January 10, 2025π︎
Enhanced
- Better parsing for LinkedIn URLs when they are split across two lines.
January 7, 2025π︎
Enhanced
- Reduced cost of all Skills Intelligence Ontology API endpoints to 1 credit per API call.
December 16, 2024π︎
Enhanced
- Skills Intelligence APIs now include 'Certifications' skill types. See the documentation for more information.
December 9, 2024π︎
Enhanced
- The parser ignores hidden sample texts from Canva CVs
Fixed
- Fixed issue concerning phone numbers being parsed from URLs
November 11, 2024π︎
Enhanced
- English: 10-20% faster parsing, even faster for longer CVs
October 14, 2024π︎
Enhanced
- PDFs: improved language guessing (#24320)
- Improve detection of Canadian addresses (#25451)
September 11, 2024π︎
Fixed
- Fixed issue when adding logos to resume formatter templates
September 9, 2024π︎
Enhanced
- Significantly improved search times.
- Fixed an issue causing education and work experience items to parse incorrectly on LinkedIn CVs when the document was written in English but the section headers were in Chinese.
Added
- Added Time Scale to job parsing salary output.
August 7, 2024π︎
Enhanced
- Improve handling of addresses from Puerto Rico (#23241)
- Improvements to country detection (#21406, #22904)
August 6, 2024π︎
Enhanced
- Improved keyword extraction in Match
July 31, 2024π︎
Enhanced
- LLM Parsing is now available in all languages supported by the Resume Parser. Language-level settings are available by reaching out to your account manager orsales@textkernel.com.
- LLM Parsing credit cost reduced from 1 to 0.1 additional credits.
July 18, 2024π︎
Enhanced
- Word documents that only contain scanned images now get sent to OCR.
Fixed
- Adding/Updating user-defined tags to indexed CVs and Jobs is now idempotent. Documents can no longer have duplicate tags.
- Parse location correctly in LinkedIn CVs when the city is in New York state.
July 8, 2024π︎
Enhanced
- Improvements to country detection (Zendesk #22394)
June 27, 2024π︎
Added
- Education StartDate is now output when using the LLM Parser.
June 24, 2024π︎
Enhanced
- Ignore emails address of certain CV generators
June 13, 2024π︎
Enhanced
- The LLM Parser, Job Description API, and Education Normalization are now available in the AU Data Center.
June 11, 2024π︎
Enhanced
- Improved parsing of LinkedIn HTML input
May 30, 2024π︎
Enhanced
- Improve serialization of Flex Request responses from the LLM.
May 24, 2024π︎
Enhanced
- Fix issue deserializing education parsing response from OpenAI in the LLM Parser.
May 22, 2024π︎
Enhanced
- Japanese: significant improvement to many parsing fields
May 10, 2024π︎
Enhanced
- Increased upper limit on /v10/ontology/suggest-skills-from-professions endpoint from 10 to 50.
May 8, 2024π︎
Enhanced
- LLM Parser and FlexRequests now support a higher throughput.
May 1, 2024π︎
Added
- Resume Formatter now supports custom templates.
Enhanced
- Unsupported HTTP methods return a proper 405 - Method Not Allowed status code.
April 17, 2024π︎
Enhanced
- Get All Sub-Accounts endpoint now returns whether or not an account is Unrestricted and the CreditsRemaining if not.
- Matching & Searching with NormalizeJobTitles turned on now only charges credits when there are job titles to be normalized in the FilterCriteria
Fixed
- Fixed education normalization when country is UK or when education item has more than 1000 characters.
April 16, 2024π︎
Enhanced
- Ignore emails address of certain CV generators
April 10, 2024π︎
Fixed
- Fix some very rare cases where a document would take too long to parse.
April 9, 2024π︎
Enhanced
- OCR Stability Improvements
- Improved performance when parsing with OutputPdf set to true and sending a document that is already a PDF.
March 28, 2024π︎
Added
- Added endpoints to retrieve Sub-Account details for easier account management.
Fixed
- Fixed an issue where RelatedProfessionClasses were always returned in English when parsed with the Deep Learning engine.
March 20, 2024π︎
Enhanced
- Improved candidate name parsing for LinkedIn PDFs.
Fixed
- Fixed an issue where first and last names would be swapped when the CV uses combined casing, such as 'FIRSTNAME Lastname'.
Removed
- No longer support parsing ZIP files or other compression file formats.
March 18, 2024π︎
Enhanced
- Fixed issues with parsing of LinkedIn Public and Private HTML profiles
March 13, 2024π︎
Added
- The Resume Parser and Job Parser now support parsing images via OCR. You can enable OCR from the Developer Center in the Tx Console. All parses will incur a small additional credit cost.
Enhanced
- Customers using the self-hosted version of the software can now download the software as containers. Please review the Installation Guide for instructions and more details.
March 6, 2024π︎
Enhanced
- Significantly improved multi-column and PDF text conversion in the AU Data Center resulting in general parsing improvements across the board.
March 5, 2024π︎
Fixed
- Fixed an issue that prevented viewing invoices
- Report correct MonthsOfWorkExperience when CV has overlapping date ranges for jobs.
- Fixed error that would occur when searching and matching on more than 3 job titles in a Semantic Expression when Job Title Normalization is enabled.
- No longer identify items in slash-delimited lists as state names. For example, "BC/BS" or "AR/AP".
Added
- Added much-improved degree normalization (local and international) and deprecated old degree normalization fields. New fields are Degree.NormalizedLocal and Degree.NormalizedInternational, both with Code and Description.
- New Job Description API - leverage LLMs to generate full job descriptions based on criteria like job title, skills, and location
Enhanced
- Improved email address parsing when the conversion from pdf to text contains additional whitespace characters.
- Added support for LLM Parser and FlexRequests to SDKs.
- 22% error reduction in school name parsing for US colleges/universities.
- Linkedin PDF: Better handling of names with many titles
Deprecated
- Deprecated Degree.Name.Normalized and Degree.Type. Use Degree.NormalizedLocal or Degree.NormalizedInternational now.
March 4, 2024π︎
Enhanced
- Job Descriptions API supports German, French and Dutch
February 23, 2024π︎
Enhanced
- All languages: Improve salary reasoning when specific letter casing is used
February 5, 2024π︎
Enhanced
- Improved salary parsing in Dutch job descriptions
January 23, 2024π︎
Enhanced
- Improved location parsing in cases where the city is also a major international city, such as Alexandria, Virginia, and Alexandria, Egypt.
- Matching UI (deprecated): Expanded the list of countries available in the location filter.
- Improved name parsing in rare cases where the name has no spaces (formatting issue) or only a single name is provided.
Fixed
- Self-Hosted Only: Fixed bug in v10 API where Tx-AccountId and Tx-ServiceKey were not accepted headers.
January 9, 2024π︎
Enhanced
- Added many new V2 skills and updated some mappings to related professions.
- Do not report "pincode" as a name or part of the street address (denotes a postal code in India locales).
- Strip leading bullets/asterisk/other special characters when parsing email addresses.
Fixed
- Fixed issue causing UserDefinedTags and CustomValueIds not being used appropriately when Searching via the Matching UI.
- Fixed issue causing a response with a 500 status code when not including a Limit in the Suggest Skills from Profession, Suggest Skills From Skills, Suggest Professions, Autocomplete Skills, and Autocomplete Professions endpoints.
Version 9.16.7π︎
December 4, 2023
Changesπ︎
Preparing for rebrand to 'Tx Platform'. Users do not need to make any changes. No disruption in service is expected.
Version 9.16.6π︎
November 30, 2023
Changesπ︎
Preparing for rebrand to 'Tx Platform'. Users do not need to make any changes. No disruption in service is expected.
Version 9.16.4π︎
November 16, 2023
New Featuresπ︎
CV/Resume Parser FlexRequests (Beta)π︎
Powered by our LLM Parsing Engine, all CV/Resume Parser requests can now include FlexRequests. FlexRequests allow you to ask any question about a resume. More details can be found here.
Improvementsπ︎
CV/Resume & Job Parserπ︎
For Skills V1, removed dangerous two-letter skills such as AP (Accounts Payable or Andhra Pradesh) and AR (Accounts Receivable or Arkansas).
Do not output a language competency when the CV contains things like serviced Dutch clients, etc.
No longer output country name as part of the work history description in certain cases.
Recognize National Senior Certificate as an educational degree.
Version 9.16.3π︎
October 26, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed bug that would cause a 500 status code to be returned when parsing some Hebrew, Japanese, and Turkish documents.
Version 9.16.2π︎
October 25, 2023
Improvementsπ︎
CV/Resume & Job Parserπ︎
Increased maximum file size limit to 16MB.
LLM Parsing Engine (Beta)π︎
Greatly improved the Professional Summary using the power of our LLM Parsing Engine.
Now giving more clear error message returned alongside 429 status codes.
Version 9.16.1π︎
October 12, 2023
Improvementsπ︎
LLM Parsing Engine (Beta)π︎
Increased amount of skills found during parsing.
Bug Fixesπ︎
LLM Parsing Engine (Beta)π︎
Fixed issue causing duplicate Profession classes to be output.
Fixed issue not correctly calculating total months experience and last used date for skills.
Version 9.16.0π︎
October 2, 2023
New Featuresπ︎
CV/Resume Parserπ︎
The LLM Parsing Engine is now available as a beta feature. It can be enabled by setting the UseLLMParser property on resume parsing requests. More details can be found here.
Version 9.15.3π︎
September 7, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Added many new V2 Skills.
Version 9.15.2π︎
August 25, 2023
Bug Fixesπ︎
All Componentsπ︎
Stability improvements.
Version 9.15.1π︎
August 24, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Improved parsing for rare edge cases in LinkedIn profiles, such as when a company name and dates are split across pages.
Version 9.15.0π︎
August 15, 2023
Improvementsπ︎
Search & Matchπ︎
Search now supports results past 1000 documents. For more information, reach out to sales@textkernel.com.
Improved ranking of search results when using skill:() or title:() clauses in the SearchExpression field.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed an issue with Portuguese LinkedIn PDF work history parsing.
Version 9.14.8π︎
August 1, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Skills normalization is now supported for Turkish CVs.
Bug Fixesπ︎
CV/Resume Parserπ︎
In V9, fixed skills output for Hebrew, Japanese, and Turkish CVs when the skill was not found in a particular section of the CV.
Corrected the Profession Group IDs to match the values used in the Skills Intelligence APIs.
Job Parserπ︎
Corrected the Profession Group IDs to match the values used in the Skills Intelligence APIs.
Version 9.14.7π︎
July 19, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Improved certification parsing when the description includes the word 'Certification'.
Better city/state/country parsing when no postal code or street address is included, especially for locations in US and India.
Now recognize VTU, VIT, and ISB as universities.
Other minor improvements for education and location parsing in India.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed issue in LinkedIn PDFs where some descriptions of time periods could be mistaken for a work history entry.
Version 9.14.6π︎
June 30, 2023
Bug Fixesπ︎
All Componentsπ︎
Stability improvements.
Version 9.14.5π︎
June 21, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Fixed bug when parsing education on Hebrew, Japanese, and Turkish documents.
Version 9.14.3π︎
June 7, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed bug where Employer Locations were not being output on Hebrew, Japanese, and Turkish documents.
Minor fix for LinkedIn profiles with very long company names.
Version 9.14.2π︎
May 25, 2023
Bug Fixesπ︎
Job Parsingπ︎
Fixed bug where sending images caused an unhandled exception. Now the API response will be 422 - Unprocessable Entity.
Version 9.14.1π︎
May 17, 2023
New Featuresπ︎
Download Indexed Documentsπ︎
You can now download a list of all documents in your indexes via the Tx Console. Review the Standard Transaction Costs for pricing. The download includes 2 columns - the document id and the document's revision date.
Bug Fixesπ︎
Job Parsingπ︎
Fixed bug where the Required field was always output as false when using V2 Skills.
Version 9.14.0π︎
May 11, 2023
Improvementsπ︎
Job Parsing Outputπ︎
Significantly improved job parsing accuracy and greatly reduced error rates for almost all output fields.
Added many new fields to the parsing output. Some languages do not include these fields in the output. See this document for additional details.
- Job Description
- Job Requirements
- Benefits
- Employer Description
- Start Date
- Salary
- Minimum Working Hours
- Maximum Working Hours
- Working Hours
- Is Remote
- Drivers License Requirements
- Local Education Level
- Employment Type
- Contract Type
- Application Description
- Contact Person
- Contact Phone
- Contact Email
- Website
- Application Deadline
- Posted Date
- Reference Number
Version 9.13.5π︎
May 4, 2023
Improvementsπ︎
CV/Resume & Job Parserπ︎
Added many new V2 Skills.
V2 Skills are now available in the following additional languages: Croatian, Czech, Greek, Hungarian, Romanian, Russian, Slovak, Slovenian, and Turkish.
Version 9.13.4π︎
April 14, 2023
Improvementsπ︎
Search & Matchπ︎
Minor stability improvements during indexing.
Version 9.13.3π︎
April 13, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Minor stability improvements.
Version 9.13.2π︎
March 30, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Major improvements to candidate name parsing, especially in cases where the name appears in an email address or LinkedIn URL.
Polish LinkedIn CVs now parse correctly.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed an issue where sometimes GPA would be picked up as a date in the education section.
Some minor runtime bug fixes and rare timeout fixes.
Search & Matchπ︎
Fixed the 'check if document exists' endpoint to return 404 - Not Found if the document does not exist.
Version 9.13.1π︎
March 9, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Decreased time to timeout parsing on Turkish, Hebrew, and Japanese documents. A timeout in these languages will return a 408 - Request Timeout response.
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Search & Matchπ︎
Fixed bug giving 500 when trying to index a resume without metadata. A 400 status code will now be returned instead.
Version 9.13.0π︎
March 2, 2023
Transaction Cost Updatesπ︎
This release introduces an improvement in the transaction cost structure for accounts with Search & Match enabled. Following this change, you will be able to parse more documents using the same number of credits, when you choose not to index all of the parsed documents.
The transaction cost structure will be changed as follows:
| Action | Old base transaction cost | New base transaction cost |
|---|---|---|
| Parse resume or job | 2 credits when matching is enabled, else 1 | 1 credit |
| Index or update a full resume or job | 0 credits | 1 credit |
| Update tags for an already indexed document | 0 credits | 0 credits |
Please note that the base transaction cost does not cover additional services such as geocoding and normalization of professions and skills. The cost for these add-ons will remain unchanged. You can see full details regarding transaction costs here.
Improvementsπ︎
Search & Matchπ︎
Documents parsed without Search & Match enabled can now later be indexed and used for matching/searching.
CV/Resume Parserπ︎
Improved name parsing on English resumes from Indian locales.
Bug Fixesπ︎
Search & Matchπ︎
Fixed bug sometimes outputting NaN for SuggestedCategoryWeights on bimetric scoring responses.
Version 9.12.3π︎
February 16, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
For Portuguese resumes, correctly output Portuguese as the 'mother tongue' instead of Spanish.
Hebrew, Japanese and Turkish resumes now correctly output the FoundMonth and FoundDay fields on Dates.
Fixed various issues where parsing didn't properly timeout.
Other minor runtime bug fixes.
Version 9.12.1π︎
February 1, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
Stability improvements.
Version 9.12.0π︎
January 26, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Version 9.11.4π︎
January 18, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Improved accuracy for parsing names/locations when the language is English and the locale is India.
Bug Fixesπ︎
CV/Resume Parserπ︎
Output correct type for certain web addresses, such as GitHub.
For Swedish resumes, do not output Danish as the 'mother tongue'.
Fixed some rare runtime bugs in the parser.
Job Parserπ︎
Do not consider languages found in certain sections of a job order as 'required'.
Search & Matchπ︎
Fixed an issue where specifying geocoordinates during job parsing caused an error.
Handle certain whitespace in index names instead of causing an error.
Resumes with a very large number of skills no longer cause an error during matching.
Version 9.11.2π︎
December 15, 2022
Bug Fixesπ︎
Document Converterπ︎
Fixed an issue where text files would not generate HTML when OutputHtml was set to true.
Version 9.11.1π︎
December 14, 2022
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed regression with LinkedIn Profile PDF parsing that was introduced in v9.11.0.
Version 9.11.0π︎
December 12, 2022
New Featuresπ︎
CV/Resume Parserπ︎
Japanese and Turkish are now supported languages. See this document for additional details.
CV/Resume & Job Parserπ︎
Significantly improved multi-column and PDF text conversion resulting in general parsing improvements across the board.
Added version selection to professions normalization for taxonomies that offer multiple versions. ONET "2010" is deprecated. Use "2019" instead. See ONET Version for more details.
Improvementsπ︎
CV/Resume Parserπ︎
Major improvements for Hebrew parsing.Low usage fields are no longer output for this language.
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Fixed various issues where parsing didn't properly timeout.
Version 9.10.4π︎
November 28, 2022
Improvementsπ︎
CV/Resume Parserπ︎
Improved recognition of university names.
Report Email-myname@company.com as myname@company.com instead of Email-myname@company.com and related scenarios.
Version 9.10.3π︎
November 22, 2022
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed runtime parsing error.
Fixed various scenarios that rarely caused requests to take more than 25s.
Version 9.10.1π︎
November 10, 2022
Bug Fixesπ︎
Search & Matchπ︎
Fixed error where unrelated job titles were being considered a variation of each other. This was only an issue in the matching logic and had no impact on parsing and doesn't require any reindexing.
Version 9.10.0π︎
November 03, 2022
Improvementsπ︎
CV/Resume Parserπ︎
Improve name detection when split across multiple lines
Added support for NIH Stroke Scale certification
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed detection of employment history when denoted by Current Employer header.
Reduce reporting of false positive company names when a company isn't specified.
Reduce false positive certifications from being reported.
Minor runtime bug fixes.
Version 9.9.1π︎
October 19, 2022
Bug Fixesπ︎
Job Parserπ︎
Fixed bug ouputting V2 Skills in Version 10.
SOAP (deprecated)π︎
Fixed an issue that would return an invalid URL when generating a new service proxy.
Fixed bug that would return errors when using SOAP version 1.2.
Version 9.9.0π︎
October 11, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Speed improvements.
Added many new V2 Skills.
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Version 9.8.1π︎
August 3, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better LinkedIn URL parsing when URL is split across multiple lines.
Improved parsing of Danish LinkedIn PDFs.
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Fixed redaction of candidate first name from candidate summary when using V2 skills.
Search & Matchπ︎
Fixed error that would occur when filtering by Job Titles and including a null Settings parameter in the request.
Version 9.8.0π︎
July 18, 2022
Enhancementsπ︎
CV/Resume & Job Parserπ︎
Major upgrade of the Skills Taxonomy (V2)π︎
-
Upgrade for freeto Skills V2. Benefits of V2 include:
- 2x larger skills taxonomy, updated frequently based on real-world data.
- 15-40% higher accuracy of extracted skills.
- Ability to use Skills Normalization (see below).
- When using V2, the candidate summary will no longer contain skills information unless Skills Normalization is also enabled (see below)
Enable Skills Normalization to benefit from better skills classificationπ︎
- New! Skill types are output (IT/Professional/Soft Skills)
- Enhances the candidate summary with structured skills information
- Better clustering of skill synonyms
- Compatibility with the taxonomy used in Textkernel's Skills Intelligence APIs and Jobfeed, enabling standardization of taxonomies across all of your data and benchmarking against jobs posted online.
- Consumes 0.1 additional credits per transaction
New Featuresπ︎
CV/Resume & Job Parserπ︎
Normalization of Job Titlesπ︎
Enable Professions Normalization to standardize job titles according to a taxonomy.
- For Resumes, the most recent 3 job titles are normalized. View the new output properties here.
- Supported Taxonomies: Textkernel, ISCO 2008, O*NET 2010. ONET 2019 will be added soon.
- The professions taxonomy and the mappings are compatible with the taxonomies used in Textkernel's Skills Intelligence APIs and Jobfeed, enabling standardization of taxonomies across all of your data and benchmarking against jobs posted online.
- Supported languages: English, German, French, Spanish, Italian, Dutch, Japanese, Simplified Chinese, Portuguese, Polish
- Consumes 0.2 additional credits per transaction
Search & Matchπ︎
Normalization of Job Titlesπ︎
Get more search and match results by extending the query beyond exact job titles.
-
Normalized job titles are automatically included in Match queries
- Requires the candidates and jobs to be parsed and indexed with Professions Normalization enabled
-
Manual search queries can be normalized in real-time through the API (see here)
-
Consumes 0.2 additional credits per transaction
Version 9.7.5π︎
May 4, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Added some new IT skills.
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Fixed a few minor runtime errors.
Search & Matchπ︎
Better handling of invalid Index and Document IDs.
Version 9.7.4π︎
April 6, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Added many new IT skills.
Removed some job titles from the skills data. Users should use the Configuration parameter to output job titles as skills.
Small improvement to Awards section header recognition.
Bug Fixesπ︎
Search & Matchπ︎
Fixed SearchExpression validation that caused occasional search expressions to return a bad request. This occured when the search expression was only a preposition and used the title:() syntax.
Version 9.7.3π︎
March 23, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better detection/cleanup of HTML-encoded entities in text that is only partially HTML.
Small improvement to Certifications section header recognition.
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Fixed a problem where some LinkedIn profiles would be truncated on certain words/phrases.
Matching UIπ︎
Do not require the user to login if the email address (Username) contained uppercase characters.
Version 9.7.2π︎
February 10, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Improved municipality and region parsing in all languages/locales.
Better date alignment, and therefore work history parsing, for some LinkedIn profiles with 2+ positions at one employer.
Improved de-columnization for LinkedIn profiles with only 1 position listed. This greatly improves parsing for these profiles.
Resume Formatterπ︎
Skill tags are now a neutral color.
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Don't allow Japanese punctuation in an English resume to cause the language to be reported as Japanese.
[v10 only] Fixed a mapping issue that was causing a DateOfBirth of 0001-01-01 to be output when no birth date was found.
Eliminated rare cases where setting OutputHtml = true was causing the parsing request to hang.
Resume Formatterπ︎
No longer output company email/phone/etc if those are not provided in the API request.
Fixed problem where not setting a footer caused the output document to be malformed.
Version 9.7.1π︎
December 3, 2021
New Featuresπ︎
CV/Resume & Job Parserπ︎
Hebrew is now a supported language, and Israel is a supported locale.
Version 9.7.0π︎
November 18, 2021
Improvementsπ︎
Search & Matchπ︎
/searcher endpoint now returns results sorted by a relevancy score instead of simply revision date.Learn more.
Regions in Australia are now normalized during indexing and querying.
Bug Fixesπ︎
Search & Matchπ︎
Fixed indexing documents where address contains multiple regions.
When an address is specified during geocoding, replace the address on the parsed document so the new address can be indexed along with the latitude and longitude. This can then be used for exact location filters.
Version 9.6.9π︎
November 5, 2021
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better LinkedIn parsing in multiple languages.
Eliminated some minor runtime bugs.
Improved location parsing.
Better Hungarian education parsing.
Version 9.6.8π︎
September 22, 2021
Improvementsπ︎
CV/Resume & Job Parserπ︎
Internal query and index optimizations.
Improved parsing of social media handles and URLs and monikers.
Better inference of Swedish date of birth from national identity number.
Improved robustness of PII scrubbing.
Improved LinkedIn parsing.
Reporting more accurate certifications, and fewer certification variants.
Improved Work History parsing for all languages, including date parsing.
Improved Russian language location parsing, sectioning, and Work History parsing.
Version 9.6.7π︎
August 9, 2021
Improvementsπ︎
Search & Matchπ︎
Internal query and index optimizations.
Document Converterπ︎
Improved image extraction/processing for large images.
Fixed timeouts in some documents.
Version 9.6.6π︎
August 3, 2021
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Fixed several runtime bugs.
Version 9.6.5π︎
July 27, 2021
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better parsing for personal information.
Improved sectioning of resumes, and identification of more education sections.
Better reordering of multi-column resumes.
Better parsing of LinkedIn resumes.
More accurate work history parsing, primarily in English, Chinese, German, Dutch, and Italian.
More accurate contact information parsing, especially for India and Czech and Norwegian languages.
More accurate education parsing, especially in French and Dutch.
Improved parsing for Romanian language.
Added more known certifications and personal qualifications.
Document Converterπ︎
Better detection/extraction for candidate images.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed runtime bug related to anonymizing certain resumes.
Version 9.6.3π︎
June 30, 2021
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better location parsing for Pakistan.
Improved LinkedIn parsing to support the latest LinkedIn PDF changes.
Recognize the following online education websites:
- Udemy
- Coursera
- Pluralsight
- Udacity
- CodeAcademy
- Skillshare
- edX
- DataCamp
- Treehouse
Better error handling.
Now we return an HTTP Status Code 413 - Payload Too Large for resumes or jobs that are too large to be parsed. Previously, these files would return 422 - Unprocessable Entity. The file size limit is ~6MB, which becomes ~8MB after Base64-encoding has been done. Files larger than this are typically not resumes/jobs, or are scanned images of resumes/jobs which Textkernel does not support. Scanned images need to be OCR'ed prior to sending to Textkernel to parse.
Version 9.6.2π︎
June 25, 2021
Improvementsπ︎
Document Converterπ︎
Better candidate image recognition to eliminate many false-positives.
CV/Resume & Job Parserπ︎
Greatly enhanced support for traditional Chinese language.
Improved candidate name and qualification suffixes parsing.
LinkedIn parsing enhancements.
Better Australian locale parsing.
Improved school parsing for Netherlands and Australia.
Better parsing for Canada, Portugal, China, Hungary, and Czech Republic.
Improved Contact Info, Work History, and Education parsing across all languages.
Better parsing for US addresses with no street address or postal code.
Added support for the following social media usernames/handles in Contact Info:
- GitHub
- Skype
- Stack Overflow
- Signal
- Quora
- ICQ
- Telegraph
- Telegram
- MeWe
- Parler
- Gab
Simplified and improved candidate summaries across all languages.
Improved support for multi-column resumes.
Improved Job Level categorization in position history output.
Bug Fixesπ︎
Document Converterπ︎
Replaced functionality in third-party converter that was causing a memory leak when processing some PDFs when OutputCandidateImage was set to true.
Sourcingπ︎
Fixed a runtime bug when purchasing resumes through CareerBuilder.
Version 9.6.1π︎
March 25, 2021
Improvementsπ︎
CV/Resume & Job Parserπ︎
Much better parsing for two-column resumes, including resumes that are only partially two-column.
Improved detection/parsing of date ranges that are split into a vertical multi-line section within an otherwise horizontal text section.
Better contact info parsing. We now report the best-formatted phone number when multiple versions of the same number are present in the resume. We no longer report fax numbers because it is 2021, not 1991. We no longer report work phone numbers when there are several other non-work phone numbers.
Improved recognition of stacked person names (names broken into words on separate lines).
We no longer report 'Politics', which was only active for Chinese resumes because we feel it is immoral.
Better education parsing. This is the most significant advance in education parsing in several years. Fewer false cognate school entries, better degree names, and much better major names. Also all certificates (except in certain locales such as Australia) have been moved to the Certifications section since they are really not the same as degreed education.
Far better recognition/handling of job nesting (or not) by company name and dates.
Improved accuracy on all aspects of employment parsing: company name, job title, location, and dates.
Improved parsing for LinkedIn resumes, especially for some non-English languages.
Improved parsing for Indeed's new resume format, which include an Assessments section.
Skills now generally ouput as Proper-case rather than UPPERCASE.
Improved accuracy across all data types in all languages, but especially so in English, German, Italian, Dutch, French, Russian, Hungarian, Czech, and Swedish.
Document Converterπ︎
Improved PDF Conversion.
Upgraded 3rd party software.
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Fixed various reported bugs.
Document Converterπ︎
Fixed a bug that was causing some blank HTML to be output on some corrupted/no-text documents
Version 9.5.2π︎
December 15, 2020
Improvementsπ︎
Document Converterπ︎
Improved HTML Conversion.
Bug Fixesπ︎
Search & Matchπ︎
Minor runtime bug fixes.
Version 9.5.1π︎
October 9, 2020
New Featuresπ︎
CV/Resume & Job Parserπ︎
Added initial support for Indonesian culture.
Improvementsπ︎
CV/Resume Parserπ︎
More accurate Contact Info, Education and Employment parsing.
Speed improvements.
Minor bug fixes.
Document Converterπ︎
Improvements to selection of correct conversion strategy.
Version 9.5π︎
September 3, 2020
New Featuresπ︎
Search & Matchπ︎
Scoring and Default Sortingπ︎
We have developed a new score (SovScore) that is derived from the combination of the WeightedScore and the Reverse Compatibility Score (RCS). Blending these two scores together using a proprietary algorithm allows us to provide a single score that blends results from both directional perspectives of the match. This new SovScore results in better matches rising to the top of the result set, while matches that score highly only from a single directional perspective will be scored lower.
All match results returned by the API, and in Textkernelβs own Search & Match UI, are sorted in descending order by the SovScore property. Please note that you are always free to use your own custom sorting or postprocessing of the returned results before displaying them in your own UI.
Transparencyπ︎
In an effort to continually increase the industry-leading transparency of our Search & Match engine, we have added new metadata to the match response. We added an EnrichedData property for each direction of the match. This object contains (a) the terms that were found, (b) the terms that were not found, and (c) explanatory messages (in English) about each data point. For example, in the skills object you might see that the skill 'java' was found, and in the corresponding explanation it might state that while 'java' was found in the document as a skill, it wasn't used by the candidate recently.
Improvementsπ︎
CV/Resume & Job Parserπ︎
Improved coverage for certain CAD/CAM skills.
Version 9.4.10π︎
August 26, 2020
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better nesting of jobs based on employer names.
Better phone number parsing (eliminated some false cognates).
Better work history parsing in Dutch, Russian, Italian, French, and Greek.
Improved coverage of the cloud computing sub taxonomy.
Document Converterπ︎
Can convert more documents to HTML, whereas some conversions previously failed.
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Version 9.4.9π︎
August 6, 2020
Improvementsπ︎
CV/Resume & Job Parserπ︎
Improved rejection of false positive addresses and phone numbers.
Substantial improvements to skills taxonomy weights (expressed in output as percentages).
Better parsing for Dutch, Hungarian, and Croation languages.
Better accuracy in all sections.
9% faster.
Document Converterπ︎
Better conversion to HTML and PDF.
Some files that could not be converted to HTML or PDF because of internal errors in the document can now be converted to HTML or PDF.
Version 9.4.8π︎
June 23, 2020
Improvementsπ︎
CV/Resume & Job Parserπ︎
Remove some spurious HTML tags in plain text before parsing.
Better certifications parsing.
Better location parsing in all languages.
More accurate education parsing in all languages.
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Version 9.4.7π︎
May 29, 2020
Bug Fixesπ︎
Job Parserπ︎
Fixed a language identification bug when parsing non-English jobs.
Version 9.4.6π︎
May 22, 2020
Bug Fixesπ︎
Document Converterπ︎
Fixed a rare HTML conversion issue.
Search & Matchπ︎
Fixed various request validation bugs for malformed requests.
Version 9.4.5π︎
May 13, 2020
Improvementsπ︎
CV/Resume Parserπ︎
Improved LinkedIn profile parsing.
Version 9.4.3π︎
May 1, 2020
Improvementsπ︎
CV/Resume Parserπ︎
Better Hungarian parsing.
Better parsing of certifications.
Better parsing of educational majors.
Fewer false positives on educational GPAs.
Generally more accurate.
Fixed some minor runtime bugs.
Search & Matchπ︎
Improved the handling of punctuation in full-text searching.
Bug Fixesπ︎
Search & Matchπ︎
Fixed edge case for handling bad input to CategoryWeights.
Version 9.4.2π︎
March 17, 2020
Improvementsπ︎
CV/Resume Parserπ︎
Data improvements.
Document Converterπ︎
Improved conversion of PDF to HTML.
More accurate detection of LinkedIn PDFs.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed two runtime bugs.
Version 9.4.1π︎
March 11, 2020
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed several runtime bugs.
Job Parserπ︎
Additional runtime bug fixes.
Version 9.4.0π︎
March 4, 2020
Improvementsπ︎
CV/Resume Parserπ︎
Better LinkedIn parsing.
World's fastest resume parsing is now substantially faster.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed several runtime bugs.
Version 9.3.10π︎
February 14, 2020
Improvementsπ︎
CV/Resume Parserπ︎
Better LinkedIn parsing.
Better email parsing.
Better Chinese parsing.
Improved job title parsing for Bulgarian, Estonian, Finnish, Croatian, Hungarian, Lithuanian, Latvian, Polish, Romanian, Slovak, and Slovenian.
Improved education parsing for Indonesian school names.
Improved handling of company names which are actually URLs.
Improved handling of invalid dates such as February 29, 2007 (2007 was not a leap year).
Added a new configuration option to output all Metadata in English regardless of the resume language. Set "OutputFormat.AllSummariesInEnglish = true;" in the configuration string.
Job Parserπ︎
Improved parsing for years of experience requirements.
Document Converterπ︎
Better PDF Conversions.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed several runtime bugs.
Fixed bug that caused some jobs to be duplicated when reported on some German CVs.
Version 9.3.9π︎
November 21, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Better Work History parsing.
Improved LinkedIn profile parsing.
Document Converterπ︎
Performance optimizations.
Version 9.3.8π︎
November 19, 2019
Improvementsπ︎
SaaS Servicesπ︎
System performance optimization.
Version 9.3.7π︎
November 8, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Better Swedish work history parsing.
Better parsing for Educational History in all languages.
Improved skills taxonomy.
Improved LinkedIn profile parsing for location and street level address information.
Improved LinkedIn profile URL parsing.
Improved Candidate Name suffix parsing.
Improved recognition of some obscure date formats, primarily for European resumes in a language other than English.
Improved Chinese language parsing accuracy.
Improved accuracy of school names and degrees.
Document Converterπ︎
Improved performance for converting documents to HTML.
Reduced timeouts on Excel documents.
Improved LinkedIn profile text conversion for the latest version of LinkedIn profiles.
SaaS Servicesπ︎
Improved performance for all SaaS Service endpoints.
Version 9.3.6π︎
September 30, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Improved LinkedIn profile parsing.
Added new skills.
Improved Employment parsing for Swedish resumes.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed a runtime bug.
Version 9.3.5π︎
September 26, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Added parsing support for UK academic qualifications such as GCSE, BTEC, NVQ, and DipCG.
Search & Matchπ︎
Enhanced the robustness of the matching engine.
Added support for non-integer taxonomy ids for Match by Criteria requests.
Version 9.3.4π︎
September 12, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Better logic for two-character skills to prevent reporting erroneous data as a skill.
More accurate Company Name parsing.
Better Italian Employment History parsing.
More accurate education parsing.
Up to 20x faster on some resumes, and about 9% faster overall.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed several runtime bugs that were tested to occur about once every million documents.
Version 9.3.3π︎
August 28, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Improved parsing of LinkedIn profiles in Spanish.
Improved the Experience Summary output when no dominant taxonomy was found.
Improved phone number parsing.
Document Converterπ︎
Improved detection and conversion of reversed text in poorly constructed PDF documents.
Search & Matchπ︎
Diacritics (e.g. Γ©) can now be used in index names and document IDs.
Version 9.3.2π︎
August 14, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Major improvements to Employment parsing by changing the logic used for nesting of jobs by date ranges. The logic was improved all around, meaning that more jobs that should be nested do get nested, and more jobs that should not be nested don't get nested.
More accurate LinkedIn parsing.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed several runtime bugs that were tested to occur about once every million documents.
Document Converterπ︎
Fixed a condition that would report a file as corrupt rather than having too-short-to-be-believed lines.
Version 9.3π︎
July 29, 2019
New Featuresπ︎
CV/Resume Parserπ︎
Added full support for the following languages/locales:
- Bulgarian
- Croatian
- Estonian
- Finnish
- Hungarian (was previously partially supported)
- Latvian
- Lithuanian
- Polish
- Romanian
- Slovak
- Slovenian
Job Parserπ︎
Added full support for the following languages/locales:
- Bulgarian
- Croatian
- Estonian
- Finnish
- Hungarian (was previously partially supported)
- Latvian
- Lithuanian
- Polish
- Romanian
- Slovak
- Slovenian
Search & Matchπ︎
Added full support for the following languages/locales:
- Bulgarian
- Croatian
- Estonian
- Finnish
- Hungarian (was previously partially supported)
- Latvian
- Lithuanian
- Polish
- Romanian
- Slovak
- Slovenian
Bug Fixesπ︎
Search & Matchπ︎
Fixed a bug where setting MatchCriteria.MonthsManagementExperience = null would cause an error.
Version 9.2π︎
July 18, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Accuracy improvements in all languages.
More accurate contact info parsing. Better phone number type classification.
More accurate LinkedIn parsing.
More accurate Employment parsing.
More accurate Skills parsing with new terms.
More accurate Education parsing. Far more accurate degree types in every language.
Added support for Resume Quality codes 121-124 for resumes from UK and NZ
Better parsing of Colombian national identity numbers.
Approximately 3%-14% faster than previous release.
Job Parserπ︎
Parsing throughput has increased over 50%.
Document Converterπ︎
Better detection and correction of reversed text.
More useful conversion of LinkedIn pdf profiles in the varied two-column formats.
Faster conversion of PDF to HTML.
Better trimming of leading whitespace preceding first non-whitespace character in converted text.
When documents take too long to convert, we now return converted text in many more instances, whereas before we returned no text.
Improved messaging for possible conversion errors.
New Output Validity: ovLinesSeemTooShort.
Added messages for output validity warnings and errors.
Upgraded third party DLL versions:
- Aspose Words 19.4
- Aspose PDF 19.4
- dtSearch 7.93.8596.18093
Search & Matchπ︎
We significantly improved normalization of school names behind the scenes, so that searches on school names will return more comprehensive results. We also stopped indexing high school names to reduces false positives.
Bug Fixesπ︎
CV/Resume Parserπ︎
Improved non-English Textkernel-generated candidate summaries.
Fixed bug that caused some jobs to be duplicated when reported on some German CVs.
Fixed an issue where skills and normalization data could be cached for longer than 24 hours.
Document Converterπ︎
Corrected situation where some non-binary data was being detected as binary data and being reported as ovProbableGarbageInText.
Search & Matchπ︎
Fixed an issue applying an upper bound of years experience for a skill.
Fixed an issue filtering multiple document languages in the same transaction.
Fixed an issue where requests would fail when searching or matching across multiple indexes that have documents with the same document id in the results.
Fixed an issue where match results would sometimes be out of order when sorted by the score.
Version 9.1.2π︎
November 13, 2018
Improvementsπ︎
CV/Resume Parserπ︎
7% faster.
Parses all LinkedIn past and present versions extremely accurately.
Better Swedish date parsing.
More accurate employment parsing.
More accurate education parsing.
Improved resume sectioning.
Document Converterπ︎
Better PDF conversions.
Version 9.1.1π︎
November 8, 2018
Improvementsπ︎
CV/Resume Parserπ︎
Fixed management level output for resumes with no current employment.
Search & Matchπ︎
Improve Bimetric Scoring in cases where no second-best taxonomy is found.
Better comparison algorithm for job titles that contain prepositions.
Improved languages matching algorithm.
Version 9.1π︎
October 20, 2018
Improvementsπ︎
CV/Resume Parserπ︎
Greatly improved parsing of gradepoint averages in Education.
Greatly reduced the number of spurious trailing work history jobs or educational schools.
Thousands of improvements to internal data lists.
Vastly improved LinkedIn parsing. We are now able to capture the hidden LinkedIn urls, and ignore the broken partial LinkedIn urls.
Degrees which are just certifications and not intended to be high school-or-higher degrees are now not output in Education, but rather, are output in Certifications.
Better parsing of school names. Fewer school names with City names hanging on the end (sometimes they need to be left that way; other times they need to be stripped β we do both better now).
Better parsing of Russian, Italian, and Norwegian schools and degrees.
Far more accurate nesting of PositionHistory nodes within EmployerOrg nodes: specifically, far fewer wrongful nesting events, and a few more correct nesting events.
We restored and improved the parsing accuracy for BOTH past and present LinkedIn resumes in all known formats.
Improved Company Name and Position Title accuracy by several percentage points. Improved the ability to distinguish between ambiguous elements.
Document Converterπ︎
Better removal of page numbers.
Vastly improved LinkedIn conversions. Conversion to single column format now happens in correct order. Page markers are properly removed. Broken lines are re-connected.
Have real formatted HTML output available from PDFs now.
Improved HTML-to-text conversions. HTML should not contain tabs except within <pre> tags, but some HTML wrongfully does. In the past, these tabs were converted to a single space; now, we convert them to multiple spaces. This ends up allowing the Parser to βseeβ many more section headers that in the past were invisible because they collided with nearby words.
Bug Fixesπ︎
Fixed a bug in the ReservedData section output that would cause an error in scrubbing PII.
We were eliminating some valid URLs. We fixed that so that we now report more URLs.
Version 9.0.2π︎
September 7, 2018
Improvementsπ︎
Better PDF conversions to fix some things intentionally broken by LinkedIn.
Version 9.0.0π︎
May 18, 2018
Upgrade Pathπ︎
If you are upgrading from version 8.0 or later, switching to version 9 is as simple as changing the url of the service from v8 to v9. No other changes needed, typically.
If you are upgrading from version 7.5 or earlier, this version isn't compatible with version 7.5. To upgrade to version 9.0, we recommend the following approach:
- If you're using a parser configuration string, regenerate your string in the new human-readable Name=Value pair format.
- Parse the Sample.doc file (as well as some of your own documents) in the current version you use, and with 9.0 using our Demo Application and save those results to disk.
- Use a document comparison tool to evaluate the differences, specifically the new fields. There is a lot of new metadata provided that could be of high value to integrate in your application. These new fields are detailed below in the New Features section. For a document comparison tool, we really like Beyond Compare.
- Remap your API calls to the new 9.0 methods as described in the API Documentation, make the desired changes to your implementation to leverage the new metadata, change the URL to point to version 9.0, and enjoy.
New Featuresπ︎
Added an endpoint to scrub the Personally Identifiable Information from a Resume/CV. More information can be found in the REST API documentation.
Improvementsπ︎
CV/Resume Parserπ︎
Improved the skills taxonomies for all languages. We added a new taxonomy/Subtaxonomy for all languages: "No dominant taxonomy β Not enough data". When we cannot determine the taxonomy with confidence because so few (or no) skills were found, we output "No dominant taxonomy β Not enough data".
Improved accuracy on Work History and Education.
Improved sectioning of resumes.
Overall accuracy is up about 3 absolute percentage points, with 99% of the previous speed. Textkernel parsing speed is typically at least 5x faster than our nearest competitorβs speed, and we produce about 1/3 to 1/10 of the mistakes as our nearest competitor.
Search & Match Engineπ︎
Improved the handling of management level queries in Matching when there was no management level data in the source document.
Breaking Changesπ︎
Skillsπ︎
We deprecated the SkillsStyle property because we now have a single canonical way and place to output skills.
Skills are now output only in the resume's UserArea, or job's SkillsTaxonomyOutput. The output is extremely easy to read and understand from both a human and programmatic standpoint. The output taxonomies are sorted in descending order of importance, and skills are alphabetical within the subtaxonomies, and child skills are nested within parent skills.
Also, importantly, we now use the English skills list for non-English skills parsing in addition to the detected language's built-in skills list. This will generally result in more skills being found, with very few false cognates.
DO NOT use/rely on the skill Ids that are output. We reserve the right to modify skill names and to preserve the skill Id when we do so. In some cases, we append a language code to skill Ids so that we can output them alongside another translation of that skill with the same Id. If you are relying on skill Ids, stop!
NOTE FOR CUSTOM SKILLS LISTS: When developing your custom skills lists, you must avoid using ANY Textkernel taxonomy or skill Ids. The only way to be certain of that is to prepend or append an alphabetical character to your Ids if they are only integers.
Otherπ︎
We deprecated the ParserSettings.OutputFormat.ReportAllCompanyNamesAndPositionTitlesRegardless and ParserSettings.OutputFormat.ContactMethod properties.
We made these properties read-only:
ParserSettings.OutputFormat.XmlFormatParserSettings.OutputFormat.MinimumCompanyNameProbabilityParserSettings.OutputFormat.MinimumPositionTitleProbability
We moved the Bimetric Score endpoint from /bimetricanalyzer to /scorer/bimetric.
Bug Fixesπ︎
Fixed an uncommon issue in our JSON output where some arrays were output as objects when they ony had a single item.
Fixed an issue where the OutputFormat.NormalizeRegions parser setting was being ignored.