Data Hub Guide
Learn how to use the Data Hub to download large datasets in bulk from the CAT Grand Database.
What is the Data Hub?
The Data Hub is a bulk download tool that lets you:
- Download emissions data for multiple counties at once
- Select multiple years and pollutants
- Export data as CSV files
- Access public CAT Grand Database data
- Get large datasets for offline analysis
This is useful for:
- Regional analysis: Downloading data for multiple counties
- Time series analysis: Getting data for multiple years
- External analysis: Exporting data for use in Excel, R, Python, etc.
- Reporting: Getting comprehensive datasets for reports
Accessing the Data Hub
- Click "DATAHUB" in the sidebar (TOOLS section)
- The Data Hub page will load with selection controls
Understanding the Interface
The Data Hub has a sidebar layout:
- Left Sidebar: Selection controls (years, pollutants, metrics, aggregation, file upload)
- Main Area: Download button and instructions
Step-by-Step: Downloading Data
Step 1: Select Years
- In the "YEAR" dropdown (left sidebar), select one or more years
- You can select multiple years (up to 60 years maximum)
- Hold Ctrl (Windows) or Cmd (Mac) to select multiple years
- Selected years will be included in your download
Step 2: Select Pollutant
- In the "POLLUTANT" dropdown, select the pollutant:
- CO2e: Carbon dioxide equivalent
- CO: Carbon monoxide
- NOx: Nitrogen oxides
- And others
- Note: You can only select one pollutant per download
Step 3: Select Metric
- In the "METRIC" dropdown, select what you want to download:
- Emissions: Emissions values
- VMT: Vehicle miles traveled
- Vehicles: Vehicle counts
- You can select one metric per download
Step 4: Select Aggregation Level
- In the "AGGREGATION" dropdown, select how data should be grouped:
- Overall: No sub-grouping
- By Vehicle Type: Group by source type
- By Fuel Type: Group by fuel type
- By Road Type: Group by road classification
- By Regclass: Group by regulatory class
Step 5: Upload Geographic Areas
- You need to provide a list of counties (geoids) to download
- Click "Download Sample File" to get a template CSV file
- Open the sample file in Excel or a text editor
- Edit the file to include the FIPS codes (geoids) you want:
- One geoid per row
- Format: 5-digit FIPS code (e.g., 36109)
- Maximum 100 geoids per download
- Save the file as CSV
- Click "Choose File" or "Browse" in the Data Hub
- Select your edited CSV file
Sample CSV Format:
geoid
36109
36023
36107
Step 6: Download Data
- Review your selections:
- Years selected
- Pollutant selected
- Metric selected
- Aggregation level
- Geographic areas file uploaded
- Click the "DOWNLOAD" button
- A loading spinner will appear: "Processing your request..."
- The system will generate your CSV file
- After processing (may take 30-60 seconds), the file will download automatically
Understanding the Limits
Input Limits
- Years: Maximum 60 years per download
- Pollutants: Maximum 1 pollutant per download
- Geographic Areas: Maximum 100 counties (geoids) per download
- File Format: CSV file with geoid column
File Size Considerations
Large downloads (many years × many counties) may:
- Take longer to process (30-60 seconds or more)
- Result in large CSV files (several MB)
- Require more time to download
Tips for Using the Data Hub
- Start Small: Test with a few years and counties first
- Use Sample File: Always use the sample file template to ensure correct format
- Check Geoid Format: Ensure FIPS codes are 5 digits (add leading zeros if needed)
- Plan Your Downloads: Consider splitting very large requests into multiple downloads
- Save Your Files: Downloaded CSV files can be large - make sure you have space
Common Tasks
Task 1: Download Single County, Multiple Years
- Select multiple years (e.g., 2020-2025)
- Select pollutant (e.g., CO2e)
- Select metric (e.g., Emissions)
- Select aggregation (e.g., Overall)
- Create CSV with one geoid
- Upload and download
Task 2: Download Multiple Counties, Single Year
- Select one year
- Select pollutant and metric
- Create CSV with multiple geoids (up to 100)
- Upload and download
- Get data for all counties in one file
Task 3: Regional Analysis
- Create CSV with all counties in your region
- Select years of interest
- Download emissions data
- Use in Excel or other tools for analysis
Understanding the Downloaded Data
CSV File Structure
The downloaded CSV will contain:
- Geographic identifiers: geoid, county name, state
- Year: The year for each record
- Pollutant: The pollutant code
- Aggregation fields: Depending on aggregation level (vehicle type, fuel type, etc.)
- Metric values: The selected metric (emissions, VMT, or vehicles)
Data Format
- Rows: Each row represents a unique combination of geoid, year, and aggregation category
- Columns: Geographic info, year, pollutant, aggregation fields, metric value
- Format: Standard CSV (comma-separated values)
Troubleshooting
"Invalid File Format" Error
If you get a file format error:
- Make sure you're using the sample file template
- Check that the file is saved as CSV (not Excel .xlsx)
- Verify the geoid column header is exactly "geoid"
- Ensure geoids are 5-digit numbers (add leading zeros if needed)
"Too Many Geoids" Error
If you exceed the 100 geoid limit:
- Split your request into multiple downloads
- Create separate CSV files with up to 100 geoids each
- Download each file separately
- Combine results in Excel or other tools if needed
Download Not Starting
If the file doesn't download:
- Check your browser's download settings
- Ensure pop-up blockers aren't preventing the download
- Wait longer (large downloads can take 60+ seconds)
- Try again if it times out
Processing Takes Too Long
If processing seems stuck:
- Very large requests (many years × many counties) can take 1-2 minutes
- Wait at least 2 minutes before assuming it failed
- Try a smaller request first to test
- Contact support if it consistently fails
Next Steps
After downloading data:
- Open in Excel: Import the CSV for analysis
- Use in R/Python: Load CSV for statistical analysis
- Create Visualizations: Use the data in your own charts
- Generate Reports: Include data in your reports
Related Topics
- Creating Orders - Create custom modeling orders
- Viewing Orders - Access your order results
- Visualizer Guide - Interactive data exploration
- Aggregator Guide - Compare counties side-by-side
Ready to compare counties? See Aggregator Guide for side-by-side comparisons.