CreateParquetCache

Summary

Creates a cache for an Apache Parquet file.

Discussion

Before accessing a Parquet file for the first time in ArcGIS Pro, you can build the file's local cache. This is recommended for Parquet files that contain more than 500,000 rows. The number of columns in the file can also increase the amount of time it takes to create the cache. If the Parquet file contains hundreds of columns, create a cache for the file before accessing it in ArcGIS Pro, even if it contains fewer than 500,000 rows.

When accessing a file in a cloud storage location, provide the path to the cloud storage connection that accesses the Parquet file, and the name of the Parquet file.

Syntax

CreateParquetCache (parquet_file)
ParameterExplanationData Type
parquet_file

The path to the Parquet file.

String

Code sample

CreateParquetCache example 1

Build cache for a Parquet file named trafficstops.parquet that is stored on disk.

import arcpy

arcpy.da.CreateParquetCache(r'c:\data\files\trafficstops.parquet')
CreateParquetCache example 2

Build cache for a Parquet file named waypoints.parquet stored in an S3 bucket.

import arcpy

arcpy.da.CreateParquetCache(r'd:\cloudconnections\mybucket.acs\waypoints.parquet')