Supported File Formats🔗
Our Parsing, Searching and Matching solutions support file input in virtually any format.
Document Formats🔗
For parsing and indexing, the TK Platform supports more than 70 document formats, including:
- PDF (for scanned PDFs see Image Formats)
- Microsoft Word (doc, docx etc)
- HTML
- WordPad / Rich Text Format (rtf)
- Plain Text (txt)
- OpenOffice Writer (odt)
- Apple iWork Pages
The formats above cover nearly 100% of the documents that we process in practice. If your format is not in the list above, please contact Textkernel support to confirm that it is supported.
Archive Formats🔗
For CVs/Resumes only, the following archive formats are supported by the TK Platform:
- ZIP
- EML
One of the documents contained within the archive is automatically identified as the CV/Resume and parsed. The maximum number of files allowed in archives is 10.
Image Formats🔗
For CVs/Resumes only, the following image or scanned inputs are supported through an optional OCR (Optical Character Recognition) add-on:
- PDF (image)
- BMP
- JPG
- GIF
- PNG
- TIFF
Please note that images take considerably longer to parse due to the additional OCR step, which is computationally expensive.