# Supported File Types

At Extracta LABS, we understand the importance of versatility in document extraction processes. Our platform is engineered to accommodate a broad spectrum of document types, ensuring that you can seamlessly integrate our solutions into your workflow, regardless of the document formats you work with.

## Comprehensive Format Support

Our AI-powered extraction technology is designed to handle documents in various formats, including image files, PDFs, and Microsoft Word documents. This capability ensures that Extracta LABS can meet your needs, whether you're processing scanned documents, digital files, or editable documents.

## Currently Supported Formats:

* **Image Files**: Ideal for scanned documents, photographs of documents, and screenshots (.jpeg, .jpg, .png, .tiff, .bmp).
  * ```
    image/jpeg
    ```
  * ```
    image/jpg
    ```
  * ```
    image/png
    ```
  * ```
    image/tiff
    ```
  * ```
    image/bmp
    ```
* **PDF**: Suitable for digital documents that maintain their formatting across different platforms (.pdf).
  * ```
    application/pdf
    ```
* **Microsoft Word Document:** Perfect for editable text documents created in Microsoft Word (.docx, .doc)
  * ```
    application/msword
    ```
  * ```
    application/vnd.openxmlformats-officedocument.wordprocessingml.document
    ```
* **Text Files**: Essential for parsing data from plain text files, facilitating straightforward text extraction without formatting complexities (.txt)
  * ```
    text/plain
    ```

## Processing Capabilities

Extracta's sophisticated document parsing technology not only supports a wide range of file types but also ensures high accuracy in data extraction. Our platform leverages advanced Optical Character Recognition (OCR) techniques to extract text and data efficiently, even from complex document layouts.

## Getting Started

To begin extracting data from your documents, simply upload your files in one of the supported formats via our API. For detailed instructions on how to upload your documents and create extraction requests, please refer to our API documentation.

By supporting multiple document formats, Extracta LABS aims to provide a flexible and comprehensive solution for your document extraction needs. Whether you're working with printed material, digital documents, or editable files, our platform is equipped to deliver precise and reliable extraction results.

{% content-ref url="/pages/Kr8TtNYErc3zJQQ68V7K" %}
[1. Create extraction](/data-extraction-api/api-endpoints-data-extraction/1.-create-extraction.md)
{% endcontent-ref %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.extracta.ai/api-reference/supported-file-types.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
