Release notes archive
Version 9.16.7π︎
December 4, 2023
Changesπ︎
Preparing for rebrand to 'Tx Platform'. Users do not need to make any changes. No disruption in service is expected.
Version 9.16.6π︎
November 30, 2023
Changesπ︎
Preparing for rebrand to 'Tx Platform'. Users do not need to make any changes. No disruption in service is expected.
Version 9.16.4π︎
November 16, 2023
New Featuresπ︎
CV/Resume Parser FlexRequests (Beta)π︎
Powered by our LLM Parsing Engine, all CV/Resume Parser requests can now include FlexRequests. FlexRequests allow you to ask any question about a resume. More details can be found here.
Improvementsπ︎
CV/Resume & Job Parserπ︎
For Skills V1, removed dangerous two-letter skills such as AP (Accounts Payable or Andhra Pradesh) and AR (Accounts Receivable or Arkansas).
Do not output a language competency when the CV contains things like serviced Dutch clients
, etc.
No longer output country name as part of the work history description in certain cases.
Recognize National Senior Certificate
as an educational degree.
Version 9.16.3π︎
October 26, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed bug that would cause a 500
status code to be returned when parsing some Hebrew, Japanese, and Turkish documents.
Version 9.16.2π︎
October 25, 2023
Improvementsπ︎
CV/Resume & Job Parserπ︎
Increased maximum file size limit to 16MB.
LLM Parsing Engine (Beta)π︎
Greatly improved the Professional Summary using the power of our LLM Parsing Engine.
Now giving more clear error message returned alongside 429 status codes.
Version 9.16.1π︎
October 12, 2023
Improvementsπ︎
LLM Parsing Engine (Beta)π︎
Increased amount of skills found during parsing.
Bug Fixesπ︎
LLM Parsing Engine (Beta)π︎
Fixed issue causing duplicate Profession classes to be output.
Fixed issue not correctly calculating total months experience and last used date for skills.
Version 9.16.0π︎
October 2, 2023
New Featuresπ︎
CV/Resume Parserπ︎
The LLM Parsing Engine is now available as a beta feature. It can be enabled by setting the UseLLMParser property on resume parsing requests. More details can be found here.
Version 9.15.3π︎
September 7, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Added many new V2 Skills.
Version 9.15.2π︎
August 25, 2023
Bug Fixesπ︎
All Componentsπ︎
Stability improvements.
Version 9.15.1π︎
August 24, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Improved parsing for rare edge cases in LinkedIn profiles, such as when a company name and dates are split across pages.
Version 9.15.0π︎
August 15, 2023
Improvementsπ︎
Search & Matchπ︎
Search now supports results past 1000 documents. For more information, reach out to sales@textkernel.com.
Improved ranking of search results when using skill:()
or title:()
clauses in the SearchExpression field.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed an issue with Portuguese LinkedIn PDF work history parsing.
Version 9.14.8π︎
August 1, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Skills normalization is now supported for Turkish CVs.
Bug Fixesπ︎
CV/Resume Parserπ︎
In V9, fixed skills output for Hebrew, Japanese, and Turkish CVs when the skill was not found in a particular section of the CV.
Corrected the Profession Group IDs to match the values used in the Skills Intelligence APIs.
Job Parserπ︎
Corrected the Profession Group IDs to match the values used in the Skills Intelligence APIs.
Version 9.14.7π︎
July 19, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Improved certification parsing when the description includes the word 'Certification'.
Better city/state/country parsing when no postal code or street address is included, especially for locations in US and India.
Now recognize VTU, VIT, and ISB as universities.
Other minor improvements for education and location parsing in India.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed issue in LinkedIn PDFs where some descriptions of time periods could be mistaken for a work history entry.
Version 9.14.6π︎
June 30, 2023
Bug Fixesπ︎
All Componentsπ︎
Stability improvements.
Version 9.14.5π︎
June 21, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Fixed bug when parsing education on Hebrew, Japanese, and Turkish documents.
Version 9.14.3π︎
June 7, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed bug where Employer Locations were not being output on Hebrew, Japanese, and Turkish documents.
Minor fix for LinkedIn profiles with very long company names.
Version 9.14.2π︎
May 25, 2023
Bug Fixesπ︎
Job Parsingπ︎
Fixed bug where sending images caused an unhandled exception. Now the API response will be 422 - Unprocessable Entity
.
Version 9.14.1π︎
May 17, 2023
New Featuresπ︎
Download Indexed Documentsπ︎
You can now download a list of all documents in your indexes via the Tx Console. Review the Standard Transaction Costs for pricing. The download includes 2 columns - the document id and the document's revision date.
Bug Fixesπ︎
Job Parsingπ︎
Fixed bug where the Required field was always output as false
when using V2 Skills.
Version 9.14.0π︎
May 11, 2023
Improvementsπ︎
Job Parsing Outputπ︎
Significantly improved job parsing accuracy and greatly reduced error rates for almost all output fields.
Added many new fields to the parsing output. Some languages do not include these fields in the output. See this document for additional details.
- Job Description
- Job Requirements
- Benefits
- Employer Description
- Start Date
- Salary
- Minimum Working Hours
- Maximum Working Hours
- Working Hours
- Is Remote
- Drivers License Requirements
- Local Education Level
- Employment Type
- Contract Type
- Application Description
- Contact Person
- Contact Phone
- Contact Email
- Website
- Application Deadline
- Posted Date
- Reference Number
Version 9.13.5π︎
May 4, 2023
Improvementsπ︎
CV/Resume & Job Parserπ︎
Added many new V2 Skills.
V2 Skills are now available in the following additional languages: Croatian, Czech, Greek, Hungarian, Romanian, Russian, Slovak, Slovenian, and Turkish.
Version 9.13.4π︎
April 14, 2023
Improvementsπ︎
Search & Matchπ︎
Minor stability improvements during indexing.
Version 9.13.3π︎
April 13, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Minor stability improvements.
Version 9.13.2π︎
March 30, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Major improvements to candidate name parsing, especially in cases where the name appears in an email address or LinkedIn URL.
Polish LinkedIn CVs now parse correctly.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed an issue where sometimes GPA would be picked up as a date in the education section.
Some minor runtime bug fixes and rare timeout fixes.
Search & Matchπ︎
Fixed the 'check if document exists' endpoint to return 404 - Not Found
if the document does not exist.
Version 9.13.1π︎
March 9, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Decreased time to timeout parsing on Turkish, Hebrew, and Japanese documents. A timeout in these languages will return a 408 - Request Timeout
response.
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Search & Matchπ︎
Fixed bug giving 500
when trying to index a resume without metadata. A 400
status code will now be returned instead.
Version 9.13.0π︎
March 2, 2023
Transaction Cost Updatesπ︎
This release introduces an improvement in the transaction cost structure for accounts with Search & Match enabled. Following this change, you will be able to parse more documents using the same number of credits, when you choose not to index all of the parsed documents.
The transaction cost structure will be changed as follows:
Action | Old base transaction cost | New base transaction cost |
---|---|---|
Parse resume or job | 2 credits when matching is enabled, else 1 | 1 credit |
Index or update a full resume or job | 0 credits | 1 credit |
Update tags for an already indexed document | 0 credits | 0 credits |
Please note that the base transaction cost does not cover additional services such as geocoding and normalization of professions and skills. The cost for these add-ons will remain unchanged. You can see full details regarding transaction costs here.
Improvementsπ︎
Search & Matchπ︎
Documents parsed without Search & Match enabled can now later be indexed and used for matching/searching.
CV/Resume Parserπ︎
Improved name parsing on English resumes from Indian locales.
Bug Fixesπ︎
Search & Matchπ︎
Fixed bug sometimes outputting NaN
for SuggestedCategoryWeights
on bimetric scoring responses.
Version 9.12.3π︎
February 16, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
For Portuguese resumes, correctly output Portuguese as the 'mother tongue' instead of Spanish.
Hebrew, Japanese and Turkish resumes now correctly output the FoundMonth
and FoundDay
fields on Dates.
Fixed various issues where parsing didn't properly timeout.
Other minor runtime bug fixes.
Version 9.12.1π︎
February 1, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
Stability improvements.
Version 9.12.0π︎
January 26, 2023
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Version 9.11.4π︎
January 18, 2023
Improvementsπ︎
CV/Resume Parserπ︎
Improved accuracy for parsing names/locations when the language is English and the locale is India.
Bug Fixesπ︎
CV/Resume Parserπ︎
Output correct type for certain web addresses, such as GitHub.
For Swedish resumes, do not output Danish as the 'mother tongue'.
Fixed some rare runtime bugs in the parser.
Job Parserπ︎
Do not consider languages found in certain sections of a job order as 'required'.
Search & Matchπ︎
Fixed an issue where specifying geocoordinates during job parsing caused an error.
Handle certain whitespace in index names instead of causing an error.
Resumes with a very large number of skills no longer cause an error during matching.
Version 9.11.2π︎
December 15, 2022
Bug Fixesπ︎
Document Converterπ︎
Fixed an issue where text files would not generate HTML when OutputHtml was set to true
.
Version 9.11.1π︎
December 14, 2022
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed regression with LinkedIn Profile PDF parsing that was introduced in v9.11.0.
Version 9.11.0π︎
December 12, 2022
New Featuresπ︎
CV/Resume Parserπ︎
Japanese and Turkish are now supported languages. See this document for additional details.
CV/Resume & Job Parserπ︎
Significantly improved multi-column and PDF text conversion resulting in general parsing improvements across the board.
Added version selection to professions normalization for taxonomies that offer multiple versions. ONET "2010"
is deprecated. Use "2019"
instead. See ONET Version for more details.
Improvementsπ︎
CV/Resume Parserπ︎
Major improvements for Hebrew parsing.Low usage fields are no longer output for this language.
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Fixed various issues where parsing didn't properly timeout.
Version 9.10.4π︎
November 28, 2022
Improvementsπ︎
CV/Resume Parserπ︎
Improved recognition of university names.
Report Email-myname@company.com as myname@company.com
instead of Email-myname@company.com
and related scenarios.
Version 9.10.3π︎
November 22, 2022
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed runtime parsing error.
Fixed various scenarios that rarely caused requests to take more than 25s.
Version 9.10.1π︎
November 10, 2022
Bug Fixesπ︎
Search & Matchπ︎
Fixed error where unrelated job titles were being considered a variation of each other. This was only an issue in the matching logic and had no impact on parsing and doesn't require any reindexing.
Version 9.10.0π︎
November 03, 2022
Improvementsπ︎
CV/Resume Parserπ︎
Improve name detection when split across multiple lines
Added support for NIH Stroke Scale
certification
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed detection of employment history when denoted by Current Employer
header.
Reduce reporting of false positive company names when a company isn't specified.
Reduce false positive certifications from being reported.
Minor runtime bug fixes.
Version 9.9.1π︎
October 19, 2022
Bug Fixesπ︎
Job Parserπ︎
Fixed bug ouputting V2 Skills in Version 10.
SOAP (deprecated)π︎
Fixed an issue that would return an invalid URL when generating a new service proxy.
Fixed bug that would return errors when using SOAP version 1.2.
Version 9.9.0π︎
October 11, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Speed improvements.
Added many new V2 Skills.
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Version 9.8.1π︎
August 3, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better LinkedIn URL parsing when URL is split across multiple lines.
Improved parsing of Danish LinkedIn PDFs.
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Fixed redaction of candidate first name from candidate summary when using V2 skills.
Search & Matchπ︎
Fixed error that would occur when filtering by Job Titles and including a null Settings parameter in the request.
Version 9.8.0π︎
July 18, 2022
Enhancementsπ︎
CV/Resume & Job Parserπ︎
Major upgrade of the Skills Taxonomy (V2)π︎
-
Upgrade for freeto Skills V2. Benefits of V2 include:
- 2x larger skills taxonomy, updated frequently based on real-world data.
- 15-40% higher accuracy of extracted skills.
- Ability to use Skills Normalization (see below).
- When using V2, the candidate summary will no longer contain skills information unless Skills Normalization is also enabled (see below)
Enable Skills Normalization to benefit from better skills classificationπ︎
- New! Skill types are output (IT/Professional/Soft Skills)
- Enhances the candidate summary with structured skills information
- Better clustering of skill synonyms
- Compatibility with the taxonomy used in Textkernel's Skills Intelligence APIs and Jobfeed, enabling standardization of taxonomies across all of your data and benchmarking against jobs posted online.
- Consumes 0.1 additional credits per transaction
New Featuresπ︎
CV/Resume & Job Parserπ︎
Normalization of Job Titlesπ︎
Enable Professions Normalization to standardize job titles according to a taxonomy.
- For Resumes, the most recent 3 job titles are normalized. View the new output properties here.
- Supported Taxonomies: Textkernel, ISCO 2008, O*NET 2010. ONET 2019 will be added soon.
- The professions taxonomy and the mappings are compatible with the taxonomies used in Textkernel's Skills Intelligence APIs and Jobfeed, enabling standardization of taxonomies across all of your data and benchmarking against jobs posted online.
- Supported languages: English, German, French, Spanish, Italian, Dutch, Japanese, Simplified Chinese, Portuguese, Polish
- Consumes 0.2 additional credits per transaction
Search & Matchπ︎
Normalization of Job Titlesπ︎
Get more search and match results by extending the query beyond exact job titles.
-
Normalized job titles are automatically included in Match queries
- Requires the candidates and jobs to be parsed and indexed with Professions Normalization enabled
-
Manual search queries can be normalized in real-time through the API (see here)
-
Consumes 0.2 additional credits per transaction
Version 9.7.5π︎
May 4, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Added some new IT skills.
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Fixed a few minor runtime errors.
Search & Matchπ︎
Better handling of invalid Index and Document IDs.
Version 9.7.4π︎
April 6, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Added many new IT skills.
Removed some job titles from the skills data. Users should use the Configuration parameter to output job titles as skills.
Small improvement to Awards section header recognition.
Bug Fixesπ︎
Search & Matchπ︎
Fixed SearchExpression validation that caused occasional search expressions to return a bad request. This occured when the search expression was only a preposition and used the title:() syntax.
Version 9.7.3π︎
March 23, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better detection/cleanup of HTML-encoded entities in text that is only partially HTML.
Small improvement to Certifications section header recognition.
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Fixed a problem where some LinkedIn profiles would be truncated on certain words/phrases.
Matching UIπ︎
Do not require the user to login if the email address (Username) contained uppercase characters.
Version 9.7.2π︎
February 10, 2022
Improvementsπ︎
CV/Resume & Job Parserπ︎
Improved municipality and region parsing in all languages/locales.
Better date alignment, and therefore work history parsing, for some LinkedIn profiles with 2+ positions at one employer.
Improved de-columnization for LinkedIn profiles with only 1 position listed. This greatly improves parsing for these profiles.
Resume Formatterπ︎
Skill tags are now a neutral color.
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Don't allow Japanese punctuation in an English resume to cause the language to be reported as Japanese.
[v10 only] Fixed a mapping issue that was causing a DateOfBirth
of 0001-01-01
to be output when no birth date was found.
Eliminated rare cases where setting OutputHtml = true
was causing the parsing request to hang.
Resume Formatterπ︎
No longer output company email/phone/etc if those are not provided in the API request.
Fixed problem where not setting a footer caused the output document to be malformed.
Version 9.7.1π︎
December 3, 2021
New Featuresπ︎
CV/Resume & Job Parserπ︎
Hebrew is now a supported language, and Israel is a supported locale.
Version 9.7.0π︎
November 18, 2021
Improvementsπ︎
Search & Matchπ︎
/searcher
endpoint now returns results sorted by a relevancy score instead of simply revision date.Learn more.
Regions in Australia are now normalized during indexing and querying.
Bug Fixesπ︎
Search & Matchπ︎
Fixed indexing documents where address contains multiple regions.
When an address is specified during geocoding, replace the address on the parsed document so the new address can be indexed along with the latitude and longitude. This can then be used for exact location filters.
Version 9.6.9π︎
November 5, 2021
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better LinkedIn parsing in multiple languages.
Eliminated some minor runtime bugs.
Improved location parsing.
Better Hungarian education parsing.
Version 9.6.8π︎
September 22, 2021
Improvementsπ︎
CV/Resume & Job Parserπ︎
Internal query and index optimizations.
Improved parsing of social media handles and URLs and monikers.
Better inference of Swedish date of birth from national identity number.
Improved robustness of PII scrubbing.
Improved LinkedIn parsing.
Reporting more accurate certifications, and fewer certification variants.
Improved Work History parsing for all languages, including date parsing.
Improved Russian language location parsing, sectioning, and Work History parsing.
Version 9.6.7π︎
August 9, 2021
Improvementsπ︎
Search & Matchπ︎
Internal query and index optimizations.
Document Converterπ︎
Improved image extraction/processing for large images.
Fixed timeouts in some documents.
Version 9.6.6π︎
August 3, 2021
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Fixed several runtime bugs.
Version 9.6.5π︎
July 27, 2021
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better parsing for personal information.
Improved sectioning of resumes, and identification of more education sections.
Better reordering of multi-column resumes.
Better parsing of LinkedIn resumes.
More accurate work history parsing, primarily in English, Chinese, German, Dutch, and Italian.
More accurate contact information parsing, especially for India and Czech and Norwegian languages.
More accurate education parsing, especially in French and Dutch.
Improved parsing for Romanian language.
Added more known certifications and personal qualifications.
Document Converterπ︎
Better detection/extraction for candidate images.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed runtime bug related to anonymizing certain resumes.
Version 9.6.3π︎
June 30, 2021
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better location parsing for Pakistan.
Improved LinkedIn parsing to support the latest LinkedIn PDF changes.
Recognize the following online education websites:
- Udemy
- Coursera
- Pluralsight
- Udacity
- CodeAcademy
- Skillshare
- edX
- DataCamp
- Treehouse
Better error handling.
Now we return an HTTP Status Code 413 - Payload Too Large
for resumes or jobs that are too large to be parsed. Previously, these files would return 422 - Unprocessable Entity
. The file size limit is ~6MB, which becomes ~8MB after Base64-encoding has been done. Files larger than this are typically not resumes/jobs, or are scanned images of resumes/jobs which Textkernel does not support. Scanned images need to be OCR'ed prior to sending to Textkernel to parse.
Version 9.6.2π︎
June 25, 2021
Improvementsπ︎
Document Converterπ︎
Better candidate image recognition to eliminate many false-positives.
CV/Resume & Job Parserπ︎
Greatly enhanced support for traditional Chinese language.
Improved candidate name and qualification suffixes parsing.
LinkedIn parsing enhancements.
Better Australian locale parsing.
Improved school parsing for Netherlands and Australia.
Better parsing for Canada, Portugal, China, Hungary, and Czech Republic.
Improved Contact Info, Work History, and Education parsing across all languages.
Better parsing for US addresses with no street address or postal code.
Added support for the following social media usernames/handles in Contact Info:
- GitHub
- Skype
- Stack Overflow
- Signal
- Quora
- ICQ
- Telegraph
- Telegram
- MeWe
- Parler
- Gab
Simplified and improved candidate summaries across all languages.
Improved support for multi-column resumes.
Improved Job Level categorization in position history output.
Bug Fixesπ︎
Document Converterπ︎
Replaced functionality in third-party converter that was causing a memory leak when processing some PDFs when OutputCandidateImage
was set to true.
Sourcingπ︎
Fixed a runtime bug when purchasing resumes through CareerBuilder.
Version 9.6.1π︎
March 25, 2021
Improvementsπ︎
CV/Resume & Job Parserπ︎
Much better parsing for two-column resumes, including resumes that are only partially two-column.
Improved detection/parsing of date ranges that are split into a vertical multi-line section within an otherwise horizontal text section.
Better contact info parsing. We now report the best-formatted phone number when multiple versions of the same number are present in the resume. We no longer report fax numbers because it is 2021, not 1991. We no longer report work phone numbers when there are several other non-work phone numbers.
Improved recognition of stacked person names (names broken into words on separate lines).
We no longer report 'Politics', which was only active for Chinese resumes because we feel it is immoral.
Better education parsing. This is the most significant advance in education parsing in several years. Fewer false cognate school entries, better degree names, and much better major names. Also all certificates (except in certain locales such as Australia) have been moved to the Certifications section since they are really not the same as degreed education.
Far better recognition/handling of job nesting (or not) by company name and dates.
Improved accuracy on all aspects of employment parsing: company name, job title, location, and dates.
Improved parsing for LinkedIn resumes, especially for some non-English languages.
Improved parsing for Indeed's new resume format, which include an Assessments section.
Skills now generally ouput as Proper-case rather than UPPERCASE.
Improved accuracy across all data types in all languages, but especially so in English, German, Italian, Dutch, French, Russian, Hungarian, Czech, and Swedish.
Document Converterπ︎
Improved PDF Conversion.
Upgraded 3rd party software.
Bug Fixesπ︎
CV/Resume & Job Parserπ︎
Fixed various reported bugs.
Document Converterπ︎
Fixed a bug that was causing some blank HTML to be output on some corrupted/no-text documents
Version 9.5.2π︎
December 15, 2020
Improvementsπ︎
Document Converterπ︎
Improved HTML Conversion.
Bug Fixesπ︎
Search & Matchπ︎
Minor runtime bug fixes.
Version 9.5.1π︎
October 9, 2020
New Featuresπ︎
CV/Resume & Job Parserπ︎
Added initial support for Indonesian culture.
Improvementsπ︎
CV/Resume Parserπ︎
More accurate Contact Info, Education and Employment parsing.
Speed improvements.
Minor bug fixes.
Document Converterπ︎
Improvements to selection of correct conversion strategy.
Version 9.5π︎
September 3, 2020
New Featuresπ︎
Search & Matchπ︎
Scoring and Default Sortingπ︎
We have developed a new score (SovScore) that is derived from the combination of the WeightedScore and the Reverse Compatibility Score (RCS). Blending these two scores together using a proprietary algorithm allows us to provide a single score that blends results from both directional perspectives of the match. This new SovScore results in better matches rising to the top of the result set, while matches that score highly only from a single directional perspective will be scored lower.
All match results returned by the API, and in Textkernelβs own Search & Match UI, are sorted in descending order by the SovScore property. Please note that you are always free to use your own custom sorting or postprocessing of the returned results before displaying them in your own UI.
Transparencyπ︎
In an effort to continually increase the industry-leading transparency of our Search & Match engine, we have added new metadata to the match response. We added an EnrichedData property for each direction of the match. This object contains (a) the terms that were found, (b) the terms that were not found, and (c) explanatory messages (in English) about each data point. For example, in the skills object you might see that the skill 'java' was found, and in the corresponding explanation it might state that while 'java' was found in the document as a skill, it wasn't used by the candidate recently.
Improvementsπ︎
CV/Resume & Job Parserπ︎
Improved coverage for certain CAD/CAM skills.
Version 9.4.10π︎
August 26, 2020
Improvementsπ︎
CV/Resume & Job Parserπ︎
Better nesting of jobs based on employer names.
Better phone number parsing (eliminated some false cognates).
Better work history parsing in Dutch, Russian, Italian, French, and Greek.
Improved coverage of the cloud computing sub taxonomy.
Document Converterπ︎
Can convert more documents to HTML, whereas some conversions previously failed.
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Version 9.4.9π︎
August 6, 2020
Improvementsπ︎
CV/Resume & Job Parserπ︎
Improved rejection of false positive addresses and phone numbers.
Substantial improvements to skills taxonomy weights (expressed in output as percentages).
Better parsing for Dutch, Hungarian, and Croation languages.
Better accuracy in all sections.
9% faster.
Document Converterπ︎
Better conversion to HTML and PDF.
Some files that could not be converted to HTML or PDF because of internal errors in the document can now be converted to HTML or PDF.
Version 9.4.8π︎
June 23, 2020
Improvementsπ︎
CV/Resume & Job Parserπ︎
Remove some spurious HTML tags in plain text before parsing.
Better certifications parsing.
Better location parsing in all languages.
More accurate education parsing in all languages.
Bug Fixesπ︎
CV/Resume Parserπ︎
Minor runtime bug fixes.
Version 9.4.7π︎
May 29, 2020
Bug Fixesπ︎
Job Parserπ︎
Fixed a language identification bug when parsing non-English jobs.
Version 9.4.6π︎
May 22, 2020
Bug Fixesπ︎
Document Converterπ︎
Fixed a rare HTML conversion issue.
Search & Matchπ︎
Fixed various request validation bugs for malformed requests.
Version 9.4.5π︎
May 13, 2020
Improvementsπ︎
CV/Resume Parserπ︎
Improved LinkedIn profile parsing.
Version 9.4.3π︎
May 1, 2020
Improvementsπ︎
CV/Resume Parserπ︎
Better Hungarian parsing.
Better parsing of certifications.
Better parsing of educational majors.
Fewer false positives on educational GPAs.
Generally more accurate.
Fixed some minor runtime bugs.
Search & Matchπ︎
Improved the handling of punctuation in full-text searching.
Bug Fixesπ︎
Search & Matchπ︎
Fixed edge case for handling bad input to CategoryWeights
.
Version 9.4.2π︎
March 17, 2020
Improvementsπ︎
CV/Resume Parserπ︎
Data improvements.
Document Converterπ︎
Improved conversion of PDF to HTML.
More accurate detection of LinkedIn PDFs.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed two runtime bugs.
Version 9.4.1π︎
March 11, 2020
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed several runtime bugs.
Job Parserπ︎
Additional runtime bug fixes.
Version 9.4.0π︎
March 4, 2020
Improvementsπ︎
CV/Resume Parserπ︎
Better LinkedIn parsing.
World's fastest resume parsing is now substantially faster.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed several runtime bugs.
Version 9.3.10π︎
February 14, 2020
Improvementsπ︎
CV/Resume Parserπ︎
Better LinkedIn parsing.
Better email parsing.
Better Chinese parsing.
Improved job title parsing for Bulgarian, Estonian, Finnish, Croatian, Hungarian, Lithuanian, Latvian, Polish, Romanian, Slovak, and Slovenian.
Improved education parsing for Indonesian school names.
Improved handling of company names which are actually URLs.
Improved handling of invalid dates such as February 29, 2007 (2007 was not a leap year).
Added a new configuration option to output all Metadata in English regardless of the resume language. Set "OutputFormat.AllSummariesInEnglish = true;"
in the configuration string.
Job Parserπ︎
Improved parsing for years of experience requirements.
Document Converterπ︎
Better PDF Conversions.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed several runtime bugs.
Fixed bug that caused some jobs to be duplicated when reported on some German CVs.
Version 9.3.9π︎
November 21, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Better Work History parsing.
Improved LinkedIn profile parsing.
Document Converterπ︎
Performance optimizations.
Version 9.3.8π︎
November 19, 2019
Improvementsπ︎
SaaS Servicesπ︎
System performance optimization.
Version 9.3.7π︎
November 8, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Better Swedish work history parsing.
Better parsing for Educational History in all languages.
Improved skills taxonomy.
Improved LinkedIn profile parsing for location and street level address information.
Improved LinkedIn profile URL parsing.
Improved Candidate Name suffix parsing.
Improved recognition of some obscure date formats, primarily for European resumes in a language other than English.
Improved Chinese language parsing accuracy.
Improved accuracy of school names and degrees.
Document Converterπ︎
Improved performance for converting documents to HTML.
Reduced timeouts on Excel documents.
Improved LinkedIn profile text conversion for the latest version of LinkedIn profiles.
SaaS Servicesπ︎
Improved performance for all SaaS Service endpoints.
Version 9.3.6π︎
September 30, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Improved LinkedIn profile parsing.
Added new skills.
Improved Employment parsing for Swedish resumes.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed a runtime bug.
Version 9.3.5π︎
September 26, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Added parsing support for UK academic qualifications such as GCSE, BTEC, NVQ, and DipCG.
Search & Matchπ︎
Enhanced the robustness of the matching engine.
Added support for non-integer taxonomy ids for Match by Criteria requests.
Version 9.3.4π︎
September 12, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Better logic for two-character skills to prevent reporting erroneous data as a skill.
More accurate Company Name parsing.
Better Italian Employment History parsing.
More accurate education parsing.
Up to 20x faster on some resumes, and about 9% faster overall.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed several runtime bugs that were tested to occur about once every million documents.
Version 9.3.3π︎
August 28, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Improved parsing of LinkedIn profiles in Spanish.
Improved the Experience Summary output when no dominant taxonomy was found.
Improved phone number parsing.
Document Converterπ︎
Improved detection and conversion of reversed text in poorly constructed PDF documents.
Search & Matchπ︎
Diacritics (e.g. Γ©) can now be used in index names and document IDs.
Version 9.3.2π︎
August 14, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Major improvements to Employment parsing by changing the logic used for nesting of jobs by date ranges. The logic was improved all around, meaning that more jobs that should be nested do get nested, and more jobs that should not be nested don't get nested.
More accurate LinkedIn parsing.
Bug Fixesπ︎
CV/Resume Parserπ︎
Fixed several runtime bugs that were tested to occur about once every million documents.
Document Converterπ︎
Fixed a condition that would report a file as corrupt rather than having too-short-to-be-believed lines.
Version 9.3π︎
July 29, 2019
New Featuresπ︎
CV/Resume Parserπ︎
Added full support for the following languages/locales:
- Bulgarian
- Croatian
- Estonian
- Finnish
- Hungarian (was previously partially supported)
- Latvian
- Lithuanian
- Polish
- Romanian
- Slovak
- Slovenian
Job Parserπ︎
Added full support for the following languages/locales:
- Bulgarian
- Croatian
- Estonian
- Finnish
- Hungarian (was previously partially supported)
- Latvian
- Lithuanian
- Polish
- Romanian
- Slovak
- Slovenian
Search & Matchπ︎
Added full support for the following languages/locales:
- Bulgarian
- Croatian
- Estonian
- Finnish
- Hungarian (was previously partially supported)
- Latvian
- Lithuanian
- Polish
- Romanian
- Slovak
- Slovenian
Bug Fixesπ︎
Search & Matchπ︎
Fixed a bug where setting MatchCriteria.MonthsManagementExperience = null
would cause an error.
Version 9.2π︎
July 18, 2019
Improvementsπ︎
CV/Resume Parserπ︎
Accuracy improvements in all languages.
More accurate contact info parsing. Better phone number type classification.
More accurate LinkedIn parsing.
More accurate Employment parsing.
More accurate Skills parsing with new terms.
More accurate Education parsing. Far more accurate degree types in every language.
Added support for Resume Quality codes 121-124 for resumes from UK and NZ
Better parsing of Colombian national identity numbers.
Approximately 3%-14% faster than previous release.
Job Parserπ︎
Parsing throughput has increased over 50%.
Document Converterπ︎
Better detection and correction of reversed text.
More useful conversion of LinkedIn pdf profiles in the varied two-column formats.
Faster conversion of PDF to HTML.
Better trimming of leading whitespace preceding first non-whitespace character in converted text.
When documents take too long to convert, we now return converted text in many more instances, whereas before we returned no text.
Improved messaging for possible conversion errors.
New Output Validity: ovLinesSeemTooShort.
Added messages for output validity warnings and errors.
Upgraded third party DLL versions:
- Aspose Words 19.4
- Aspose PDF 19.4
- dtSearch 7.93.8596.18093
Search & Matchπ︎
We significantly improved normalization of school names behind the scenes, so that searches on school names will return more comprehensive results. We also stopped indexing high school names to reduces false positives.
Bug Fixesπ︎
CV/Resume Parserπ︎
Improved non-English Textkernel-generated candidate summaries.
Fixed bug that caused some jobs to be duplicated when reported on some German CVs.
Fixed an issue where skills and normalization data could be cached for longer than 24 hours.
Document Converterπ︎
Corrected situation where some non-binary data was being detected as binary data and being reported as ovProbableGarbageInText.
Search & Matchπ︎
Fixed an issue applying an upper bound of years experience for a skill.
Fixed an issue filtering multiple document languages in the same transaction.
Fixed an issue where requests would fail when searching or matching across multiple indexes that have documents with the same document id in the results.
Fixed an issue where match results would sometimes be out of order when sorted by the score.
Version 9.1.2π︎
November 13, 2018
Improvementsπ︎
CV/Resume Parserπ︎
7% faster.
Parses all LinkedIn past and present versions extremely accurately.
Better Swedish date parsing.
More accurate employment parsing.
More accurate education parsing.
Improved resume sectioning.
Document Converterπ︎
Better PDF conversions.
Version 9.1.1π︎
November 8, 2018
Improvementsπ︎
CV/Resume Parserπ︎
Fixed management level output for resumes with no current employment.
Search & Matchπ︎
Improve Bimetric Scoring in cases where no second-best taxonomy is found.
Better comparison algorithm for job titles that contain prepositions.
Improved languages matching algorithm.
Version 9.1π︎
October 20, 2018
Improvementsπ︎
CV/Resume Parserπ︎
Greatly improved parsing of gradepoint averages in Education.
Greatly reduced the number of spurious trailing work history jobs or educational schools.
Thousands of improvements to internal data lists.
Vastly improved LinkedIn parsing. We are now able to capture the hidden LinkedIn urls, and ignore the broken partial LinkedIn urls.
Degrees which are just certifications and not intended to be high school-or-higher degrees are now not output in Education, but rather, are output in Certifications.
Better parsing of school names. Fewer school names with City names hanging on the end (sometimes they need to be left that way; other times they need to be stripped β we do both better now).
Better parsing of Russian, Italian, and Norwegian schools and degrees.
Far more accurate nesting of PositionHistory nodes within EmployerOrg nodes: specifically, far fewer wrongful nesting events, and a few more correct nesting events.
We restored and improved the parsing accuracy for BOTH past and present LinkedIn resumes in all known formats.
Improved Company Name and Position Title accuracy by several percentage points. Improved the ability to distinguish between ambiguous elements.
Document Converterπ︎
Better removal of page numbers.
Vastly improved LinkedIn conversions. Conversion to single column format now happens in correct order. Page markers are properly removed. Broken lines are re-connected.
Have real formatted HTML output available from PDFs now.
Improved HTML-to-text conversions. HTML should not contain tabs except within <pre>
tags, but some HTML wrongfully does. In the past, these tabs were converted to a single space; now, we convert them to multiple spaces. This ends up allowing the Parser to βseeβ many more section headers that in the past were invisible because they collided with nearby words.
Bug Fixesπ︎
Fixed a bug in the ReservedData section output that would cause an error in scrubbing PII.
We were eliminating some valid URLs. We fixed that so that we now report more URLs.
Version 9.0.2π︎
September 7, 2018
Improvementsπ︎
Better PDF conversions to fix some things intentionally broken by LinkedIn.
Version 9.0.0π︎
May 18, 2018
Upgrade Pathπ︎
If you are upgrading from version 8.0 or later, switching to version 9 is as simple as changing the url of the service from v8 to v9. No other changes needed, typically.
If you are upgrading from version 7.5 or earlier, this version isn't compatible with version 7.5. To upgrade to version 9.0, we recommend the following approach:
- If you're using a parser configuration string, regenerate your string in the new human-readable Name=Value pair format.
- Parse the Sample.doc file (as well as some of your own documents) in the current version you use, and with 9.0 using our Demo Application and save those results to disk.
- Use a document comparison tool to evaluate the differences, specifically the new fields. There is a lot of new metadata provided that could be of high value to integrate in your application. These new fields are detailed below in the New Features section. For a document comparison tool, we really like Beyond Compare.
- Remap your API calls to the new 9.0 methods as described in the API Documentation, make the desired changes to your implementation to leverage the new metadata, change the URL to point to version 9.0, and enjoy.
New Featuresπ︎
Added an endpoint to scrub the Personally Identifiable Information from a Resume/CV. More information can be found in the REST API documentation.
Improvementsπ︎
CV/Resume Parserπ︎
Improved the skills taxonomies for all languages. We added a new taxonomy/Subtaxonomy for all languages: "No dominant taxonomy β Not enough data". When we cannot determine the taxonomy with confidence because so few (or no) skills were found, we output "No dominant taxonomy β Not enough data".
Improved accuracy on Work History and Education.
Improved sectioning of resumes.
Overall accuracy is up about 3 absolute percentage points, with 99% of the previous speed. Textkernel parsing speed is typically at least 5x faster than our nearest competitorβs speed, and we produce about 1/3 to 1/10 of the mistakes as our nearest competitor.
Search & Match Engineπ︎
Improved the handling of management level queries in Matching when there was no management level data in the source document.
Breaking Changesπ︎
Skillsπ︎
We deprecated the SkillsStyle
property because we now have a single canonical way and place to output skills.
Skills are now output only in the resume's UserArea
, or job's SkillsTaxonomyOutput
. The output is extremely easy to read and understand from both a human and programmatic standpoint. The output taxonomies are sorted in descending order of importance, and skills are alphabetical within the subtaxonomies, and child skills are nested within parent skills.
Also, importantly, we now use the English skills list for non-English skills parsing in addition to the detected language's built-in skills list. This will generally result in more skills being found, with very few false cognates.
DO NOT use/rely on the skill Ids that are output. We reserve the right to modify skill names and to preserve the skill Id when we do so. In some cases, we append a language code to skill Ids so that we can output them alongside another translation of that skill with the same Id. If you are relying on skill Ids, stop!
NOTE FOR CUSTOM SKILLS LISTS: When developing your custom skills lists, you must avoid using ANY Textkernel taxonomy or skill Ids. The only way to be certain of that is to prepend or append an alphabetical character to your Ids if they are only integers.
Otherπ︎
We deprecated the ParserSettings.OutputFormat.ReportAllCompanyNamesAndPositionTitlesRegardless
and ParserSettings.OutputFormat.ContactMethod
properties.
We made these properties read-only:
ParserSettings.OutputFormat.XmlFormat
ParserSettings.OutputFormat.MinimumCompanyNameProbability
ParserSettings.OutputFormat.MinimumPositionTitleProbability
We moved the Bimetric Score endpoint from /bimetricanalyzer
to /scorer/bimetric
.
Bug Fixesπ︎
Fixed an uncommon issue in our JSON output where some arrays were output as objects when they ony had a single item.
Fixed an issue where the OutputFormat.NormalizeRegions
parser setting was being ignored.