In its simplest form, a raster consists of a matrix of cells (or pixels) organized into rows and columns (or a grid) where each cell contains a value representing information, such as temperature. Rasters are digital aerial photographs, imagery from satellites, digital pictures, or even scanned maps.
Types of raster data
Data stored in a raster format represents real-world phenomena:
- Continuous data represents phenomena such as elevation and spectral image data collected from satellites and airborne platforms such as drones.
- Thematic data (also known as discrete) represents features such as land use or soils data.
- Pictures include scanned maps or drawings and building photographs.
Continuous and thematic rasters may be displayed as data layers along with other geographic data on your map but are often used as the source data for spatial analysis with the ArcGIS Spatial Analyst or ArcGIS Image Analyst extension. Picture rasters are often used as attributes in tables—they can be displayed with your geographic data and are used to convey additional information about map features.
Raster data applications
While the structure of raster data is simple, it is exceptionally useful for a wide range of applications. Within a GIS, the uses of raster data fall under four main categories:
- Images as basemaps
A common use of image data in a GIS is as a background display for other feature layers. For example, orthoimages displayed underneath other GIS layers provide the map user with confidence that map layers are spatially aligned and represent real objects, as well as additional information. Three main sources of image basemaps are orthoimages from aircraft, satellite imagery, and scanned maps. Below is an orthoimage as a basemap for road data.
- Rasters as surface maps
Rasters are well suited for representing data that changes continuously across a landscape (surface). They provide an effective method of storing the continuity as a surface. They also provide a regularly spaced representation of surfaces. Elevation values measured from the earth's surface are the most common application of surface maps, but other values, such as rainfall, temperature, concentration, and population density, can also define surfaces that can be spatially analyzed. The raster below displays elevation—using green to show lower elevation and red, pink, and white cells to show higher elevations.
- Rasters as thematic maps
Rasters representing thematic data can be derived from analyzing other data. A common analysis application is classifying a satellite image into land cover categories. Basically, this activity groups the values of multispectral data into classes (such as vegetation type) and assigns a categorical value. Thematic maps can also result from geoprocessing operations that combine data from various sources, such as vector, raster, and terrain data. For example, you can process data through a geoprocessing model to create a raster dataset that maps suitability for a specific activity. Below is an example of a classified raster dataset showing land use.
- Rasters as attributes of a feature
Rasters used as attributes of a feature may be digital photographs, scanned documents, or scanned drawings related to a geographic object or location. For example, a picture of a surveyed feature may be associated with a ground control point file. A parcel layer may have scanned legal documents identifying the latest transaction for that parcel, or a layer representing cave openings may have pictures of the actual cave openings associated with the point features. Below is a digital picture of a large, old tree that could be used as an attribute to a landscape layer that a city may maintain.
Why store data as a raster?
Sometimes you don't have the choice of storing your data as a raster; for example, imagery is only available as a raster. However, there are many other features (such as points) and measurements (such as rainfall) that could be stored as either a raster or a feature (vector) data type.
The advantages of storing your data as a raster are as follows:
- A simple data structure—A matrix of cells with values representing a coordinate and sometimes linked to an attribute table
- A powerful format for advanced spatial and statistical analysis
- The ability to represent continuous surfaces and perform image and surface analysis
- The ability to uniformly store points, lines, polygons, and surfaces
- The ability to perform fast overlays with complex datasets
There are other considerations and benefits for choosing a vector-based storage option. For example:
- There can be spatial inaccuracies due to the limits imposed by the raster dataset cell dimensions.
- Raster datasets are potentially very large. Resolution increases as the size of the cell decreases; however, normally cost also increases in both disk space and processing speeds. For a given area, changing cells to one-half the current size requires as much as four times the storage space, depending on the type of data and storage techniques used.
- There may also be a loss of precision that accompanies restructuring data to a regularly spaced raster-cell boundary.
General characteristics of raster data
In raster and image datasets, each cell, or pixel, has a value. The cell values represent the phenomenon portrayed by the raster dataset such as a category, magnitude, height, or image spectral value. The category could be a land use class such as grassland, forest, or road. A magnitude might represent gravity, noise pollution, or percent rainfall. Height (distance) could represent surface elevation above mean sea level, which can be used to derive slope, aspect, and watershed properties. Spectral values are used in satellite and aerial imagery represent light reflectance and color related to plant specie and health.
Pixel values can be either positive or negative, integer, or floating point. Integer values are best used to represent categorical (discrete) data and floating-point values to represent continuous surfaces. Pixels can also have a NoData value to represent the absence of data. For information on NoData, see NoData in raster datasets.
Rasters are stored as an ordered list of cell values, for example, 80, 74, 62, 45, 45, 34, and so on.
The area (or surface) represented by each cell consists of the same width and height and is an equal portion of the entire surface represented by the raster. For example, a raster representing elevation (that is, digital elevation model) may cover an area of 100 square kilometers. If there were 100 cells in this raster, each cell would represent 1 square kilometer of equal width and height (that is, 1 km x 1 km).
The dimension of the pixels can be as large or as small as needed to represent the surface conveyed by the raster dataset and the features within the surface, such as a square kilometer, square foot, or even square centimeter. The pixel size determines how coarse or fine the patterns or features in the raster or image will appear. The smaller the pixel size, the smoother or more detailed the raster will be. However, the greater the number of pixels, the longer it will take to process, and it will increase the demand for storage space, unless you use a dynamic raster function. If a pixel size is too large, information may be lost or subtle patterns may be obscured. For example, if the pixel size is larger than your minimum mapping unit, features smaller than this size may not exist within the raster dataset. In the diagram below, you can see how this simple polygon feature will be represented by a raster dataset at various pixel sizes.
The location of each cell is defined by the row or column where it is located within the raster matrix. Essentially, the matrix is represented by a Cartesian coordinate system, in which the rows of the matrix are parallel to the x-axis and the columns to the y-axis of the Cartesian plane. This is referred to as image coordinate space. Row and column values begin with 0. In the example below, if the raster is in a Universal Transverse Mercator (UTM) projected coordinate system and has a cell size of 100, the cell location at 5,1 would be 300,500 East, 5,900,600 North.
Learn more about how to georeference a raster
Often you need to specify the extent of a raster. The extent is defined by the top, bottom, left, and right coordinates of the rectangular area covered by a raster, as shown below.