Summary
Creates a cache for an Apache Parquet file.
Discussion
Before accessing a Parquet file for the first time in ArcGIS Pro, you can build the file's local cache. This is recommended for Parquet files that contain more than 500,000 rows. The number of columns in the file can also increase the amount of time it takes to create the cache. If the Parquet file contains hundreds of columns, create a cache for the file before accessing it in ArcGIS Pro, even if it contains fewer than 500,000 rows.
When accessing a file in a cloud storage location, provide the path to the cloud storage connection that accesses the Parquet file, and the name of the Parquet file.
Syntax
CreateParquetCache (parquet_file)
Parameter | Explanation | Data Type |
parquet_file | The path to the Parquet file. | String |
Code sample
Build cache for a Parquet file named trafficstops.parquet that is stored on disk.
import arcpy
arcpy.da.CreateParquetCache(r'c:\data\files\trafficstops.parquet')
Build cache for a Parquet file named waypoints.parquet stored in an S3 bucket.
import arcpy
arcpy.da.CreateParquetCache(r'd:\cloudconnections\mybucket.acs\waypoints.parquet')