Supported MIME Types in Indexing API

Overview

The Indexing API supports a variety of MIME types for various content fields in documents. This document provides a list of popular MIME types which are supported and guidelines on how to use them effectively.

Supported MIME Types

The following are some popular MIME types which are supported by the Indexing API:

  • text/html
  • pdf
  • application/rtf
  • text/plain
  • text/rtf
  • text/directory
  • application/onenote
  • application/vnd.apple.pages
  • google-sites-page
  • flash
  • ebook
  • msword
  • application/vnd.apple.keynote
  • pptx
  • ms-excel
  • spreadsheet
  • csv
  • tsv
  • text/markdown
  • application/x-apple-diskimage
  • application/x-executable
  • application/vnd.google-apps.form

Please note that this list is not exhaustive and we support many other MIME Types. If you have query about a particular MIME Type which is not listed, please contact Glean support.

Unsupported MIME Types

The following are some popular MIME types which are not supported by the Indexing API:

  • application/json
  • application/xml
  • text/css
  • text/xml
  • video
  • application/mp4
  • image
  • audio
  • zip
  • rar

Additional Information

For more details on how to use the Indexing API to start indexing a custom datasource, refer to the Indexing API Getting Started Guide.

If you encounter any issues or have questions about supported MIME types, please contact our support team.