Parquet in ArcGIS Pro

Beginning with ArcGIS Pro 3.5, there are two ways you can work with Apache Parquet files. Each serves a different purpose and has different requirements as follows:

  • Create a multifile feature connection from multiple Parquet files that have the same schema—Use this option to analyze or visualize multiple files at once. For information and requirements for this method, see Multifile feature connections and its related help pages.
  • Access a single Parquet file stored on a local drive or in an Amazon Simple Storage Service (S3) bucket and add it to a map or scene—This method is optimized for visualizing and querying features on a map or scene when the Parquet file contains a GeoParquet format spatial field. This functionality is available with any ArcGIS Pro license level. For requirements and related information for this method, see the information below.

Requirements to use the contents of a single Parquet file

To visualize the spatial data in a Parquet file that is stored on a local drive or in an Amazon S3 bucket, the file must meet the requirements described below.

File requirements

The following requirements related to the file name and format must be met to use it with ArcGIS Pro:

  • The file extension must be .parquet.
  • Only unencrypted Parquet files are supported.
  • ArcGIS Pro supports uncompressed Parquet files or Parquet files compressed using the Snappy compression format or the GZIP compression format. Other compression formats, including custom compression formats, are not supported.
  • The file name must contain alphanumeric characters. Dashes (-) and underscores (_) are also allowed. The following special characters will be replaced with underscores in the cached layer name:
    • Parentheses (())
    • Brackets ([])
    • Curly brackets ({})
    • Ampersand (&)
    • At sign (@)
    • Caret (^)
    • Dots (.)
    • Exclamation mark (!)
    • Hash sign (#)
    • Percent sign (%)
    • Plus sign (+)
    • Single quotation mark (')
    • Tilde (~)
    • U.S. dollar sign ($)

    Other special characters are not supported.

Column requirements

The columns in the Parquet file must meet the following requirements to use with ArcGIS Pro:

  • Column names must contain 32 or fewer alphanumeric characters.
  • Columns cannot be nested.
  • The file must contain a single spatial (GeoParquet format) column. X, Y, and Z fields, latitude-longitude, and elevation fields are not recognized as shape fields at this time; they are read as string or numeric fields.
  • The spatial field must be defined in the GeoParquet standard, version 1.0.0 or 1.1.0.
  • ArcGIS Pro does not support geometry collection spatial types.

Supported uses in ArcGIS Pro

The following is a summary of how you can use a single Parquet file in ArcGIS Pro at this release. For more information, see FAQs about using a Parquet file in ArcGIS Pro.

  • The data from the Parquet file is intended for visualization and querying in ArcGIS Pro. Add the data to a map or scene to view it.
  • When you replace or alter the contents of the Parquet file using third-party software, ArcGIS Pro identifies that the file was modified and rebuilds the local cache of the data to incorporate the altered data. See Cached Parquet data for more information.
  • You can use the data from the Parquet file as input to a geoprocessing tool, but you must add the data from the Parquet file to a map in ArcGIS Pro, and use the map layer in the geoprocessing tool or ArcPy script. The tool or script will run on the map layer. You currently cannot add the Parquet file as input to geoprocessing tools or ArcPy functions directly from the folder or cloud storage location.

Related topics