Skip to content
TK Platform
Supported File Formats

Supported File Formats🔗

Our Parsing, Searching and Matching solutions support file input in virtually any format.

Document Formats🔗

For parsing and indexing, the TK Platform supports more than 70 document formats, including:

  • PDF (for scanned PDFs see Image Formats)
  • Microsoft Word (doc, docx etc)
  • HTML
  • WordPad / Rich Text Format (rtf)
  • Plain Text (txt)
  • OpenOffice Writer (odt)
  • Apple iWork Pages

The formats above cover nearly 100% of the documents that we process in practice. If your format is not in the list above, please contact Textkernel support to confirm that it is supported.

Archive Formats🔗

For CVs/Resumes only, the following archive formats are supported by the TK Platform:

  • ZIP
  • EML

One of the documents contained within the archive is automatically identified as the CV/Resume and parsed. The maximum number of files allowed in archives is 10.

Image Formats🔗

For CVs/Resumes only, the following image or scanned inputs are supported through an optional OCR (Optical Character Recognition) add-on:

  • PDF (image)
  • BMP
  • JPG
  • GIF
  • PNG
  • TIFF

Please note that images take considerably longer to parse due to the additional OCR step, which is computationally expensive.