The content of the ARM website is available to any browser, but for the best experience we highly recommend you upgrade to a standards-compliant browser such as the latest version of Mozilla, Netscape, Internet Explorer, or Opera (all available for free download).
VIEW CART
primary link menu HOME SITE INDEX PEOPLE
skip to main content ABOUT ARMABOUT ACRFSCIENCESITESINSTRUMENTSMEASUREMENTSDATAPUBLICATIONSEDUCATIONFORMS
Manage ARM Account
Order ARM Data
Data Plots

Understanding and Finding ARM Data

(2-9-98 Raymond A. McCord, rev. 12-20-2004 Giri Palanisamy)

The Atmospheric Radiation Measurement (ARM) program has generated billions of measurements of radiation, radiative transfer, meteorology, clouds, etc. (see URL http://www.arm.gov/ for more information). These measurements have been gathered by several dozen types of instruments and are stored in millions of data files. These files may be searched or categorized by site, facility, instrument type, measurement type, and time period. The intent of this documentation is to provide an overview of the organization of ARM data to help you find data you can use. After establishing a few key words, the structure of the data will be described and a few hints for finding other information about the ARM data that is not included in the files are provided. Options for finding data are based on these structural concepts and are described in the last section. A more detailed description of requesting and accessing data files from the Archive is presented in the Overview of ARM Archive User Interface.

Key words

Understanding the data from the ARM program includes the acceptance of a few key words and concepts. The following keywords are used to describe ARM data:

Explanation of ARM data structure

This section describes ARM data as a hierarchy of information (see Figure 1). [Revision note, 2/12/2002: Although the figure illustrates data through 1998, the concepts shown in the figure are still valid.] The discussion begins with the smallest unit of information and moves up the hierarchy. Although the user should understand the smaller units of ARM data, the available quantity of data is large enough, that it is easier to consider data at the higher levels.

Measurement: the smallest portion of ARM data

The smallest unit of ARM data is called a measurement. It is a value that represents an attribute of the environment (e.g., cloud base height). These measurements are usually stored as a series of values recorded over a time sequence. Measurements include values that directly represent environmental attributes (e.g., air temperature), but they may also include values about the operating condition of the instrument (e.g., the azimuth of the sensor). Each measurement is directly related to the time of observation and integration period. In addition to 'time of observation', the minimal context required to interpret a measurement includes the definition of the location, the units recorded, and the description of the method (reference to instrument, sensor, or algorithm). A discussion of the types of ARM measurements can be found at http://www.arm.gov/data/types.stm A partial list of measurement descriptions by data source can also be found at http://www.archive.arm.gov/docs/sources/ During its history the ARM program has generated billions of measurement values.

Data file: a collection of related measurements

The ARM measurements are aggregated together in data files. A data file usually contains a time series of one or more measurements for a known time interval and a single location. Most data are stored in netCDF format. This format allows for the definition of data fields and storage of operational information in the header of the file. More information on the netCDF format and tools is available from UCAR at http://www.unidata.ucar.edu/packages/netcdf/index.html Some data files also contain one or more measurements distributed over a region or other dimension from a single time (i.e., soil water and temperature profiles, satellite images, mesonet files). Data from satellites are stored in the HDF format developed by NASA (see http://hdf.ncsa.uiuc.edu/ for more information on HDF). A single data file is the smallest unit of information distributed by ARM to its data users. Persons needing less than the contents of a single data file must derive the subset with their own tools. At this time, the ARM program has generated millions of data files.

Data stream: a series of data files

Collections of data files that have a common source and structure are called data streams. A data stream represents the output of a single data source (i.e., instrument, instrument group, or algorithm). All files within a single data stream have a common information content (e.g., same measurement types and operational information, and similar name). At this time, the ARM Program has generated more than 2000 data streams.

Data stream groups: conceptually related groups of data streams

Data streams can be categorized into more than one grouping scheme (e.g., by location, source, or processing status). Understanding relationships between data streams can simplify the overwhelming magnitude of ARM data. All of the data streams within a group may share common attributes. For example, all data streams from similar sources (instruments or algorithms) will have similar (often identical) data fields containing measurements. Data streams from similar sources will also reference common instrument descriptions.

Data streams may be "grouped" in the following categories:

Spatial Structure

Data generated by ARM typically represent a geographic point (rather than a geographic line or area). Collectively the ARM points represent a model cell of a General Circulation Model (GCM). These point locations are structured in a two-level hierarchy:

1) Sites - Southern Great Plains, Tropical Western Pacific, and North Slope of Alaska. [Revision note, 2/13/2002: The usage of virtual sites to designate data "in development" is currently inactive. Dsgp... is the only development site with stored data. Many of these data streams are being renamed to sgp... Dsgp... data streams are accessible only from the Power User option in the Browser Interface.] Developmental data have limited accessibility and users from the general scientific community are encouraged to discuss the use of this type of data with the appropriate technical contacts.

2) Facilities - similar instrument installations at different geographic locations within a site. The SGP site has central, extended, boundary, and intermediate facilities. The TWP site has installations on Manus and Nauru Island (A third site for operational staging will soon be implemented at Darwin, Australia). The NSA site includes installations at Barrow and Atqasuk.

External Data - The ARM data structure also includes similar data collected by other programs. These data frequently represent spatial areas such as satellite scenes and interpolated mesonet data. More information about the spatial extent of the external data can be obtained from URLs:

The following table describes the magnitude of the major dimensions of the ARM data structure.

Dimension Number of values Discussion
Site 3 values The major field locations operated by ARM (Southern Great Plains (SGP), North Slope of Alaska, Tropical Western Pacific)
Facility ~30 values for SGP in 5 categories Central, Extended, Boundary, Intermediate, External
Data level: ~8 values versions of the same information that reflects increments of data processing and QA review; most data streams have less than 3 data levels.
Time: (Day, month, or year ) currently spans 8-9 years and will continue several more years; many values possible depending on the time resolution chosen.
Data stream: ~2000 values can be partitioned by data levels or facility; can be partitioned by source (> 100 types of instruments and algorithms).

Supporting information external to the data files

Although the data files contain the measurement values, units, sample time, and location, many other types of information can contribute to the interpretation of the data. These include:

Options for Finding Data

ARM data can be 'found' at the Archive by using various combinations of the following approaches: