Histograms visually summarize the distribution of a continuous numeric variable by measuring the frequency at which certain values appear in the dataset. The x-axis in a histogram is a number line that has been split into number ranges, or bins. For each bin, a bar is drawn where the width of the bar represents the range of the bin, and the height of the bar represents the number of data points that fall into that range. Understanding the distribution of your data is an important step in the data exploration process.
Histograms require one continuous Number variable on the x-axis.
Number of bins
The number of bins defaults to the square root of the number of records in your dataset. This can be adjusted by changing the Number of bins in the Data tab of the Chart pane. Changing the number of bins allows you to see more or less detail in the structure of your data.
Some basic descriptive statistics are calculated and displayed on histograms. The mean and median are displayed with one line each, and one standard deviation above and below the mean is displayed using two lines. You can click on these items in the chart legend to toggle them on or off.
You can modify the appearance of a histogram by specifying and formatting its title, description, and x-axis title.
Titles and description
Charts and axes are given default titles based on the variable names and chart type. These can be edited on the General tab in the Chart pane. You can also provide a chart Description, which is a block of text that appears at the bottom of the chart window.
It is not possible to change color settings for histograms in this release.
Create a histogram to visualize distribution of population density across Washington, D.C. census block groups.
- Number— Population Density