> For the complete documentation index, see [llms.txt](https://docs.extracta.ai/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.extracta.ai/api-reference/supported-file-types.md).

# Supported File Types

At Extracta LABS, we understand the importance of versatility in document extraction processes. Our platform is engineered to accommodate a broad spectrum of document types, ensuring that you can seamlessly integrate our solutions into your workflow, regardless of the document formats you work with.

## Comprehensive Format Support

Our AI-powered extraction technology is designed to handle documents in various formats, including image files, PDFs, and Microsoft Word documents. This capability ensures that Extracta LABS can meet your needs, whether you're processing scanned documents, digital files, or editable documents.

## Currently Supported Formats:

* **Image Files**: Ideal for scanned documents, photographs of documents, and screenshots (.jpeg, .jpg, .png, .tiff, .bmp).
  * ```
    image/jpeg
    ```
  * ```
    image/jpg
    ```
  * ```
    image/png
    ```
  * ```
    image/tiff
    ```
  * ```
    image/bmp
    ```
* **PDF**: Suitable for digital documents that maintain their formatting across different platforms (.pdf).
  * ```
    application/pdf
    ```
* **Microsoft Word Document:** Perfect for editable text documents created in Microsoft Word (.docx, .doc)
  * ```
    application/msword
    ```
  * ```
    application/vnd.openxmlformats-officedocument.wordprocessingml.document
    ```
* **Text Files**: Essential for parsing data from plain text files, facilitating straightforward text extraction without formatting complexities (.txt)
  * ```
    text/plain
    ```

## Processing Capabilities

Extracta's sophisticated document parsing technology not only supports a wide range of file types but also ensures high accuracy in data extraction. Our platform leverages advanced Optical Character Recognition (OCR) techniques to extract text and data efficiently, even from complex document layouts.

## Getting Started

To begin extracting data from your documents, simply upload your files in one of the supported formats via our API. For detailed instructions on how to upload your documents and create extraction requests, please refer to our API documentation.

By supporting multiple document formats, Extracta LABS aims to provide a flexible and comprehensive solution for your document extraction needs. Whether you're working with printed material, digital documents, or editable files, our platform is equipped to deliver precise and reliable extraction results.

{% content-ref url="/pages/Kr8TtNYErc3zJQQ68V7K" %}
[1. Create extraction](/data-extraction-api/api-endpoints-data-extraction/1.-create-extraction.md)
{% endcontent-ref %}