NumPy in ArcGIS

Numerical Python (NumPy) is a fundamental package for scientific computing in Python, including support for a powerful N-dimensional array object. NumPy allows you to perform complex mathematical operations. For more information, see the NumPy website.

A NumPy array is designed to work with large arrays. There are many existing Python functions that have been created to process NumPy arrays, the most noted being contained in the SciPy scientific computing package for Python.

Tables and feature data

You can convert table and feature classes to and from NumPy arrays using functions in the data access (arcpy.da) module.

To convert NumPy arrays to tables and feature classes, the arrays must be structured arrays. Structured arrays include fields (or structs) that are used to map the data to fields in ArcGIS table and feature classes. For more information on structured arrays, see Structured arrays.

Create a structured NumPy array.

import numpy

arr = numpy.array([(471316.383, 5000448.782), (470402.493, 5000049.216)],
                  numpy.dtype([('X', '>f8'),('Y', '>f8')]))

Once created, a structured NumPy array can be converted to a feature class or table.

Convert a NumPy array to a geodatabase feature class.

import arcpy
import numpy

out_fc = 'C:/data/texas.gdb/fd/pointlocations'

# Create a numpy array with an id field, and a field with a tuple 
#  of x,y coordinates
arr = numpy.array([(1, (471316.3835861763, 5000448.782036674)),
                   (2, (470402.49348005146, 5000049.216449278))],
                  numpy.dtype([('idfield', numpy.int32),('XY', '<f8', 2)]))

# Define a spatial reference for the output feature class
spatial_ref = arcpy.Describe('C:/data/texas.gdb/fd').spatialReference

# Export the numpy array to a feature class using the XY field to
#  represent the output point feature
arcpy.da.NumPyArrayToFeatureClass(arr, out_fc, ['XY'], spatial_ref)

arcpy.da functions for working with tables and feature data

FunctionExplanation
ExtendTable

Join the contents of a NumPy structured array to a table based on a common attribute field.

FeatureClassToNumPyArray

Convert a feature class to a NumPy structured array.

NumPyArrayToFeatureClass

Convert a NumPy structured array to a feature class.

NumPyArrayToTable

Convert a NumPy structured array to a table.

TableToNumPyArray

Convert a table to a NumPy structured array.

Integer fields in NumPy arrays do not support nulls. If data converted using FeatureClassToNumPyArray or TableToNumPyArray contains nulls, the rows containing the nulls should either be skipped entirely or masked with a substitute value.

Skip all records that include a null.

arr = arcpy.da.FeatureClassToNumPyArray(fc, fields, skip_nulls=True)

Mask each None in integer fields with different values using a dictionary.

fields = ['field1', 'field2']
arcpy.da.FeatureClassToNumPyArray(fc, fields, null_value=-9999)

Type conversions

The dtype objects of the created array are determined from the field type of the input table or feature class.

Field typeNumPy dtype

Short Integer

<i4

Long Integer

<i4

Big Integer

<i8

Float

<f4

Double

<f8

Text

<u1, <u10, and so on, depending on the length of the field

Date

<M8[s] or <M8[us] depending on the date field's precision

Date Only

<M8[D]

Time Only

<m8[us] or <m8[us]

Timestamp Offset

<M8[us]

Object ID

<i8

Guid

<u38

Global ID

<u38

Nota:

String fields converted to an array will have the same width. For instance, a string field with a width of 20 will have a dtype of <u20.

Other field types not listed above, including raster and BLOB fields, are not supported. Geometry fields are also not supported, but you can add multiple geometry properties to the array using the following tokens:

TokenDescription

SHAPE@XY

Una tupla de las coordenadas x,y del centroide de la entidad.

SHAPE@XYZ

Una tupla de las coordenadas x, y, z del centroide de la entidad.

SHAPE@TRUECENTROID

Una tupla de las coordenadas x,y del centroide de la entidad. Devuelve el mismo valor que SHAPE@XY.

SHAPE@X

Un doble de la coordenada x de la entidad.

SHAPE@Y

Un doble de la coordenada y de la entidad.

SHAPE@Z

Un doble de la coordenada z de la entidad.

SHAPE@M

Un doble del valor m de la entidad.

SHAPE@JSON

La cadena de caracteres JSON de Esri que representa la geometría.

SHAPE@WKB

Representación binaria conocida (WKB) para geometría OGC. Ofrece una representación portátil de un valor de geometría como una transmisión contigua de bytes.

SHAPE@WKT

Representación en texto conocida (WKT) para geometría OGC. Ofrece una representación portátil de un valor de geometría como cadena de caracteres.

SHAPE@AREA

Un doble del área de la entidad.

SHAPE@LENGTH

Un doble de la longitud de la entidad.

OID@

El valor del campo Id. de objeto.

Memory considerations

An array that requires more memory than is available will fail with a MemoryError exception.

The following are tips to avoid MemoryError exceptions:

  • Delete array objects after use; deleting the array will release the memory.
  • Use only those fields you need, especially text fields; a text field converted to an array will consume 4 bytes for every character of width. For instance, a string field with a width of 100 will consume 400 bytes of memory for each value in the array.
Nota:

numpy.nbytes returns a dictionary of dtypes and number of bytes.

Rasters

You can convert rasters to and from NumPy arrays using the ArcPy functions RasterToNumPyArray and NumPyArrayToRaster. You may want to convert an ArcGIS raster to a NumPy array to do the following:

  • Implement one of the many existing Python functions that can be applied to a NumPy array (for example, run filters on the data, perform multidimensional analysis, or use optimization routines).
  • Develop a custom function by accessing the individual cells within the NumPy array (for example, to implement neighborhood notation, change individual cell values, or run accumulative operators on an entire raster).

arcpy functions for working with rasters

FunctionExplanation
RasterToNumPyArray

Convert a raster to a NumPy array.

NumPyArrayToRaster

Convert a NumPy array to a raster.

A raster is converted to a NumPy array to calculate the percentage of the cell value in the entire raster row. A new raster is then created.

import arcpy
import numpy

my_array = arcpy.RasterToNumPyArray('C:/data/inRaster')
my_array_sum = my_array.sum(1)
my_array_sum.shape = (my_array.shape[0], 1)
my_array_perc = (my_array * 1.0) / my_array_sum

new_raster = arcpy.NumPyArrayToRaster(my_array_perc)
new_raster.save("C:/output/fgdb.gdb/PercentRaster")