Mosaic datasets

Mosaic datasets are used to manage, display, analyze, serve, and share imagery and raster data. When you create a new mosaic dataset, it is created as an empty container in the geodatabase with some default properties to which you can add raster data. You must have write access to that geodatabase.

A mosaic dataset consists of many parts:

  • A catalog that provides the source of the pixels and footprints of the imagery and rasters
  • A feature class that defines the boundary
  • A set of mosaicking rules that are used to dynamically mosaic the rasters
  • A set of properties used to control the mosaicking and image extraction
  • A table for logging during data loading and other operations
  • Optionally, a multidimensional table defining variables and dimensions
  • Optionally, a seamline feature class for seamline mosaicking
  • Optionally, a stereo table defining stereo models
  • Optionally, a color correction table that defines the color mapping for each raster in the table

You can create a mosaic dataset in a geodatabase and can add raster datasets to it directly, or you can create it using a selection from an existing mosaic dataset. A mosaic dataset manages its raster data in a table, where datasets are indexed and you can perform queries on the collections. There are geoprocessing tools in the Mosaic Dataset toolbox to create and edit mosaic datasets.

The imagery in a mosaic dataset does not have to be adjoining or overlapping but can exist as unconnected, discontinuous datasets. For example, you can have images that completely cover an area, or you can have many strips of images that may not join together to form a continuous image, such as along pipelines.

Contiguous data coverage
Continuous data coverage is shown.
Discontinuous data coverage
Discontinuous data coverage is shown.

To learn how to create mosaic datasets, see Creating Mosaic Datasets.

Types of mosaic datasets

There are two types of mosaic datasets: one allows you to add in your data, and the other allows you to reference another mosaic dataset.

When you create a mosaic dataset, you can add all types of image and raster data and modify the properties and functions applied per raster and on the mosaic dataset. This is created using the Create Mosaic Dataset tool. There are no limitations on this mosaic dataset.

When your mosaic dataset only references another mosaic dataset, it is created using the Create Referenced Mosaic Dataset tool. The referenced mosaic dataset behaves similarly to a regular mosaic dataset but is read-only. For example, you cannot add additional images to the mosaic dataset, you cannot build overviews, and you cannot calculate the pixel size ranges. It is used to serve conventional mosaic datasets with different mosaic dataset-level functions. For example, you can create a mosaic dataset to manage all your DEM data and create a referenced mosaic dataset to produce a hillshade or a slope product from the source mosaic dataset. Or, you can use a reference mosaic dataset to subset a mosaic dataset by defining a clip boundary or an attribute query such as a particular date or type of image. Sharing access to a referenced mosaic dataset also ensures that those accessing it cannot make modifications to the source mosaic dataset, which can impact other users.

Mosaic datasets and raster types

Mosaic datasets use raster types to read and absorb the required information from raster datasets. It identifies metadata, such as georeferencing, acquisition date, and sensor type, along with a raster format. A raster type can read raster data using a raster format, such as TIFF or JPEG. The Raster Dataset raster type reads all raster formats; however, you may want to use a specific raster type that has been created specifically to read and display the pixel data and apply the spatial reference associated with specific raster datasets. Also, raster function processing chains can be automatically applied to specific raster types to perform on the fly processing on the imagery, such as NDVI processing of imagery containing red and near infrared bands.

Many specific raster types have sophisticated capabilities and can recognize that a particular image, such as one provided by a satellite imaging company, includes a raster dataset with several bands, varying spatial resolutions, and other metadata that affect the spatial reference. Therefore, if the product has four bands of data at a 1-meter resolution and one band with a 30-centimeter resolution, the raster type creates a product that sharpens the lower-resolution data with the higher-resolution dataset (also known as pan-sharpening). Additionally, if the correct rational polynomial coefficient (RPC) information is provided, you can use this raster type to improve the fused data product by performing an orthorectification. Using the correct raster type, you can automatically define functions that are applied on the fly when the raster datasets are accessed.

Apply raster functions with mosaic datasets

Raster functions are a key component to every mosaic dataset. They allow the mosaic dataset to deliver a dynamically mosaicked image, and you can use them to enhance the mosaicked image product by applying on-the-fly processing operations such as orthorectification, image enhancements, and image algebra. You can add functions to the mosaic dataset or to individual images in the mosaic dataset, or they may be added when the data is added to the mosaic dataset. For example, when you add specific raster data products (such as from a satellite sensor) to a mosaic dataset, some functions are automatically added to the raster data. As mentioned above, you can add raster datasets that are used to generate an orthorectified, pan-sharpened image. To generate this image, both a pan-sharpen function and orthorectification function are applied to the raster data when it is accessed. This is advantageous, because it saves disk space, since you aren't required to store both source and preprocessed datasets. Additionally, if you wanted to process the same data differently, you can add the same data to a different mosaic dataset and apply different functions. You may still want to use the orthorectification function, but you may want to generate a vegetative index. For this, you can use the Band Arithmetic function or the NDVI function.

Note:

When you add Landsat 8 Surface Reflectance and ARD data to a mosaic dataset, a function is available in the raster dataset properties to add a mask. This mask allows you to mask out unwanted image data values that can affect subsequent raster functions such as negative values.

Mask function for Landsat 8 Surface Reflectance data

Mask function for Landsat 8 Surface Reflectance data is shown.

Manage multiple resolutions with mosaic datasets

Mosaic datasets are designed to handle data with varying resolutions—spectral, spatial, temporal, and radiometric. The raster types and functions in a mosaic dataset play a strong role in how all this data is handled and displayed. Additionally, the mosaic dataset is particularly aware of the spatial and temporal information as attributes of the raster data. Based on the pixel sizes, the mosaic dataset displays the imagery at the most appropriate scales. With some additional display control properties, called mosaic methods, a user can control the temporal information, allowing them to view the images for the dates they require.

You can set a mosaic method that controls what image data is presented each time a mosaic dataset is displayed, roamed or zoomed. For example, the mosaic is generated by displaying the dataset that is the closest to the center of the image. Another mosaic method allows you to define a query based on attributes such as acquisition date or cloud cover. These mosaic methods and querying capabilities allow users to access every raster dataset within the mosaic dataset, even when there is overlap.

There is no pixel data loss or metadata loss when using mosaic datasets, as the source pixels are never altered or converted, and the files are never moved; therefore, metadata files remain in their location. Because the mosaic dataset does not alter the source data or its location, the pixel values are not altered. Additionally, the mosaicking performed by the mosaic dataset occurs dynamically when the mosaic dataset is accessed. Users have access to the mosaicked image as well as the source data; therefore, there is no data loss occurring for overlapping datasets.

Overviews, which are similar to raster pyramids, can be generated for a mosaic dataset. Overviews are reduced resolution datasets that are generated to improve the speed at which the mosaic is displayed. You can allow the default overviews to be generated over the entire mosaic dataset. Alternatively, you can control how they're created by defining the downsampling ratio, the extent, a specific spatial resolution, and so on.

Advantages of mosaic datasets

Mosaic datasets are excellent data models for storing and managing data. Mosaic datasets are ideal for distributing data because they can be directly accessed by users and served. A server administrator can modify many properties of a mosaic dataset—such as the maximum image size, the level of metadata, the compression method, or the maximum number of downloads—to get the maximum performance out of their server and meet user needs. When clients connect to a server to see the mosaicked image, their application can control the same mosaic methods and other properties that a directly connected user can, along with the ability to select raster datasets and download them to their local disk. Not only does a mosaic dataset manage and visualize data, it is also a tool for disseminating imagery.

Related topics