CreateParquetCache

Summary

The CreateParquetCache function creates a persistent local cache for an Apache Parquet file.

Discussion

Before accessing a Parquet file for the first time in ArcGIS Pro, you can build the file's local persistent cache. This is recommended for Parquet files that contain 500,000 or more records. The number of columns in the file can also increase the amount of time it takes to create the cache. If the Parquet file contains hundreds of columns, create a cache for the file before accessing it in ArcGIS Pro, even if it contains fewer than 500,000 records.

When accessing a file in a cloud storage location, provide the path to the cloud storage connection that accesses the Parquet file, and the name of the Parquet file.

Syntax

CreateParquetCache (parquet_file)
ParameterExplanationData Type
parquet_file

The path to the Parquet file.

String

Code sample

CreateParquetCache example 1

Build a cache for a Parquet file named trafficstops.parquet that is stored on disk.

import arcpy

arcpy.da.CreateParquetCache(r'c:\data\files\trafficstops.parquet')
CreateParquetCache example 2

Build a cache for a Parquet file named waypoints.parquet stored in an S3 bucket.

import arcpy

arcpy.da.CreateParquetCache(r'd:\cloudconnections\mybucket.acs\waypoints.parquet')