Methodology | Electricity Maps

Introduction

Electricity Maps' mission is to organize the world's electricity data to drive the transition towards a truly decarbonized electricity system. We provide carbon intensity data coming from electricity consumption for more than 200 zones in the world.

To do that, we follow state-of-the-art approaches to collect data from official sources. When data is missing, we use a variety of techniques to estimate it. We then run our flow-tracing pipeline to determine how electricity (and its carbon intensity) flows between countries. This allows us to provide carbon intensity information for both production and consumption for each country. Our methodology follows these steps:

Collecting data - Our data sources (how we verify and collect our data)
How missing data is handled - Our estimation models
Computing CO2 emissions - Carbon intensity and emission factors
Data processing from collection to carbon intensity calculation.
How we predict the future state of electricity grids - Our forecasts

Data sources

In order to add a new data source to our dataset, we must ensure that the data source is official and reliable. Once the data source is verified, the data itself must be validated. All data must meet a set of requirements before we incorporate it into our production systems. These requirements are described in detail below.

Where does our data come from?

Finding data sources

Before adding a new data source, we need to make sure that we are using the best and most reliable data possible. In order to achieve this goal, we need to verify that the data originates from a source that is trustworthy and of high-quality.

Our data-collection processes (called parsers) are open source and anyone is welcome to contribute to them. Our contributors must follow the guidelines regarding data credibility and quality described here. We review all contributions thoroughly before we deploy them to our production systems.

Credible data can be published by the following organizations:

Government sources: this can be an energy ministry,
Government-affiliated sources: this can be an official statistical bureau,
Transmission or Distribution System Operators,
Utility companies that generate or manage power directly.

You can also find our existing data sources documented on our GitHub page.

Verifying data sources - Technical requirements

We evaluate source quality using these criteria:

Authority: the data provider is an authority internationally/regionally, and is recognized as a source of truth (e.g. a government energy ministry).
Accuracy: the data provider is reporting factual data of high granularity (ideally hourly data).
Confirmability: the data reported by the data provider can be confirmed and verified with other credible sources.
Primary source: the data provider is as close to the energy data being collected as possible (i.e. as energy utility companies or power plant owners).
Regulation: the data provider is ideally under legal requirements regarding the provision and quality of data provided.
Rigor: the data provider presents clear explanations about data collection and processing.
Timeliness and freshness: the data provider should provide data that is updated as frequently as possible and without significant delays. Our data pipeline fetches data every 15 minutes and collects the freshest data available.
Granularity: The data granularity is hourly or higher. Depending on the source, the data granularity can be quarter-hourly or every five minutes. Our data pipelines are built to ingest data that has a granularity equal or above 1 hour.
Production mix availability: A full breakdown of the production mix is available, and the data is collected in units of power in MW. If this is not available, then specific estimation methods are used to obtain a power breakdown from the total production. The different estimation methods can be found here.
Exchanges: The net flow of cross-border exchanges is available. Exchanges are an essential part of the flow-tracing algorithm used to compute the consumption mix and carbon intensity.

Once the technical requirements have been validated by Electricity Maps for a specific country and data source, the parser is implemented so that electricity data can be collected and ingested in our pipelines.

Which types of data does Electricity Maps exclude?

Currently Electricity Maps includes data on the electricity that flows on the power grid. Therefore our production and consumption figures do not include off-grid, stand-alone production or consumption (self-consumption). For the most part this means that production and consumption data related to small-scale rooftop solar (sometimes referred to as behind-the-meter solar) is not available on Electricity Maps. We are open to including this type of data on our app. However, thus far we have not been able to find reputable, reliable data sources that meet the requirements described above.

Electricity Maps parsers

Parsers are scripts that automatically fetch data from different sources and format it in a uniform way. The formatted data is then saved to our database and processed using the flow-tracing algorithm.

Our parsers are open-source. Many of the parsers are contributions from our community. These parsers are validated and deployed after a thorough internal review. This review includes verifying the source and ensuring that we comply with the source’s terms and conditions. We also verify that each parser meets our technical requirements before adding its data to our app. We do not deploy parsers unless they have been reviewed, validated and approved internally.

The data collected by our parsers include:

Electricity production by mode,
Electricity consumption,
Electricity exchanges between two zones,
Day-ahead prices,
Production forecasts,
Consumption forecasts,
Exchange forecasts.

For more detailed information, please consult our GitHub wiki, where you can find our complete methodology and data sources.

Carbon intensity and emission factors

Both our app and our API provide information on the carbon intensity of a given zone’s electricity. Highly granular carbon intensity data is essential for most use cases, such as load-shifting and 24/7 carbon-free energy reporting. This page describes the process we use to calculate carbon intensity figures.

What is carbon intensity?

Carbon intensity measures how clean the electricity consumption is in a zone at a given time. It represents how many grams of carbon dioxide (CO2) are released in the atmosphere for each kilowatt hour (kWh) of electricity consumed.

In other words, carbon intensity represents the greenhouse gas footprint of 1 kWh consumed inside that zone. This footprint is measured in gCO2-eq (grams of CO2 equivalent), meaning that each type of greenhouse gas can be converted to its CO2 equivalent in terms of global warming potential over 100 year (for instance, 1 gram of methane emitted has the same global warming impact during 100 years as ~34 grams of CO2 over the same period). The carbon intensity of electricity generation of a zone is determined by the power production mix and their associated carbon intensity factors. There are two types of emission factors displayed on the map: default factors and regional factors.

How is carbon intensity calculated?

Lifecycle emissions for electricity generation

The carbon-intensity factors displayed on the map are emission factors computed based on the whole lifecycle of employed fuels and generation technologies. These numbers come mostly from the IPCC (2014) Fifth Assessment Report, which aggregates emission factors estimated by a multitude of peer-reviewed studies. The IPCC 2014 report is regarded as the global standard for measuring the global warming potential of different electricity sources. Emissions resulting from the extraction of resources required to build up installed capacity, emissions from direct operations, and end-of-life related emissions are all accounted for. The overview of the steps covered throughout the lifecycle is pictured in Figure 1.

‍

For each type of power plant, lifecycle emission factors account for fuel extraction and processing, construction and dismantlement of the plant, operations and anything in between.

Computing carbon intensity of electricity

To compute the CO2 emissions of the electricity consumption for a specific zone, we need to calculate how the electricity has been produced in that specific zone or country. This calculation also accounts for the electricity exchanged between countries, and how the electricity has been produced in that neighboring country. This is defined as tracing back the origin of the electricity, or flow-tracing. If you want to know more about the flow-tracing algorithm, please refer to the following article, or to the "Carbon collection and processing" tab of the methodology.

Once the origin of electricity is identified and broken down by power plant type, we can use the predefined emission factors, see below, and assess the carbon intensity of a specific zone, resulting in the values we can see in the map (Figure 3.)

*Figure 3. Carbon intensity example after implementing the flow-tracing algorithm*

Which emission factors do we use?

Emission factors on the app

The carbon-intensity factors displayed on the map are emission factors for the whole lifecycle of consumed fuels and used power plants. As explained above, these numbers come mostly from the IPCC (2014) Fifth Assessment Report, which aggregates emission factors estimated by a multitude of peer-reviewed studies.

Sometimes TSOs or energy organizations publish emissions data linked to electricity emissions. In most cases, the values published are computed using operational emission factors. These emission factors only account for emissions that directly result from the operation of a given electricity source. For example, the combustion of gas in gas power plants directly releases greenhouse gasses in the atmosphere, while the photovoltaic effect that powers solar cells does not.

These values are very different from those displayed on the Electricity Maps app. On the app, we account for all emissions generated during the construction of power plants and during the extraction of these fuels. For some commercial purposes, we also support operational emission factors in our API.

Default emission factors

The table below illustrates the emission factors used for the majority of zones displayed on the map.

Regional emission factors

For some zones, specific emission factors are available. This is the case for the US and the EU where direct power plant emissions are publicly available. It is possible to match these emissions with power plant generation data and compute per power plant emission factors, which can be aggregated at a zone level.

There are also some reputable sources, such as peer-reviewed scientific papers, or meta-analysis that offer direct emission factors with a precise geographical granularity. Using regional emission factors enables us to increase the precision of our data and to better reflect the impact of a zone’s electricity sector.

Electricity Maps’ emission factors are all publicly available here.

Low-carbon and renewable percentage

Alongside carbon intensity we provide a low-carbon percentage and renewable percentage, these are calculate using the follow equations:

Low-carbon percentage:

100 - fossil fuel percentage or
(solar + wind + geothermal + hydro + hydro discharge + nuclear + biomass + unknown* + hydro discharge** + battery discharge**) / Total * 100

Fossil fuel percentage:

100 - low-carbon percentage or
(coal + gas + oil + unknown*) / Total * 100

Renewable percentage:

(geothermal + hydro + solar + wind + hydro discharge** + battery discharge**) / Total * 100

*By default unknown is assumed to be made up of fossil fuel based modes and is excluded from low-carbon percentage and included in fossil fuel percentage, if more accurate data is available this can be overridden by a zone specific percentage calculated from the yearly average for that zone. See example here: https://docs.google.com/spreadsheets/d/1FLHQ6e9Es08BIqX654BM3SEb_fuAk4k4O-6cmRXyw_E

**hydro and battery discharge are renewable by default but can be overridden with a specific percentage on a zone by zone basis. See example here

Looking for more detailed information?

For more information, you can consult our GitHub wiki where you will find our complete methodology on emission factors and how we compute carbon intensity.

Missing Data

What happens when we have missing data? In some cases, our data parsers are not able to collect data for one or more dates. In this scenario, we estimate the missing data in order to ensure the continuity of the time series. In other instances, the source data is not complete or not granular enough. Here we use a specific model that estimates the production breakdown as accurately as possible.

Why estimate electricity production?

At Electricity Maps we aim to provide granular information about the electricity generation continuously, meaning that data is complete 100% of the time. Ensuring that the data is completes means that it can be applied to different use cases.

However, the data coming from various sources (TSOs, energy utilities, government-affiliated sources, etc.) might be unavailable due to delays in data transfer, database capacity problems, and others, leading to what we call missing data.

We take different approaches to deal with missing data; we call these approaches estimation methods. Each estimation method serves a different purpose and is related to the nature of the missing data we encounter, such as an outage of real-time data for a few hours, data reported with insufficient precision, or the unavailability of real-time data.

Short term data gaps: Time Slicer Average (TSA) Estimation Model

The first type of gap occurs when we are missing data for a short period of time, ranging usually from hours to a week. To address these gaps we implement the Time Slicer Average (TSA) estimation method. This estimation method uses available data to fill in the gap. Each missing point in the gap is filled by the average of the available data points that belong to the same time period but on different days in the given month. The estimation is then aligned in order to ensure the continuity of the estimation points with the bounds of the gap observed.

Missing real-time data: Hierarchal Estimations

In many zones and countries, real-time data is not always initially available and may be delayed by several hours. In these situations, we will employ a system of hierarchical estimation models based on external forecasts, internal forecasts, and Time Slicer Average (TSA). These models will complement each other to provide a more accurate estimate than any single model could achieve on its own.

Initially, we will use external estimates, then fall back to our internal estimates if the external ones are unavailable. If both are unavailable, we will fall back to TSA estimates.

As we implement this, the forecast data used will be limited to solar and wind energy. Estimates for other modes will rely solely on TSA-based estimates.

Only yearly electricity production available: General Purpose Zone Development (GPZD)

The GPZD estimates hourly production for various electricity sources in a zone, using yearly electricity production data and weather information (monthly and hourly). For every electricity production mode, GPZD guarantees that calculating the total electricity produced each hour will match the zone's yearly electricity production data.

Two-step process:

Capture seasonal trends: the GPZD starts with yearly electricity production data and weather information. Using specific machine learning methods per mode, it estimates monthly electricity production, ensuring the monthly estimates add up to the yearly total. This captures seasonal variations in energy generation and zone.
Breakdown to hourly data: Next, the GPZD takes the monthly estimates and breaks them down into hourly figures. This step also uses machine learning and weather data, ensuring the hourly estimates accurately sum up to the monthly total.

Result:

The GPZD provides a detailed and plausible picture of hourly electricity production, consistent with yearly reports. This model, trained on zones with both hourly and yearly data, helps us understand energy generation patterns at a granular level.

Other estimation methods

For some specific zones, we cannot implement any of the above methods. For example, we may have real-time data for a given zone that contains an erroneous breakdown for one or more of the production modes. This is the case of Switzerland (CH), where the data provided by the data source is fetched in real-time but categorizes small-scale hydropower plants under the "unknown" production mode instead of under "hydro", leading to inaccurate values for the share of electricity coming from renewable sources. In some other cases such as The Netherlands, the data provided by the live-parser data does not match official statistics.

In these cases, specific estimation methods are implemented for each particular zone, and they are detailed here.

Estimated data vs real-time data

Once the missing data points have been estimated, the time-series is considered complete and the following steps of flow-tracing, data storage and visualization can take place.

However, we always prefer real-time data to estimation methods. In cases where data was missing due to delays and the data provider has since updated their datasets with new data, the estimated data points are replaced by the real measured data. Then, the flow-tracing pipeline is triggered again, updating the carbon intensity values for the affected hours.

Looking for more detailed information?

Please check out our technical documentation, available in our GitHub repository here

Data collection and processing

Once the data is collected and stored in our database, the estimation pipeline and the flowtracing pipeline are run. The estimation pipeline ensures that our final dataset is complete. The flowtracing pipeline is used to compute the electricity consumption breakdown and the carbon intensity of a zone’s electricity consumption.

Electricity Maps data pipeline

Our data is collected for more than 200 zones from verified data sources. The data collected needs to be aligned so that the final dataset is consistent across all zones. This process is done with the estimation pipeline and the flow-tracing pipeline. The flowtracing pipeline also outputs the zones’ average carbon intensities, which can be viewed on the app and accessed on the API. The full process is shown in the figure below (Figure 1).

*Figure 1. Data collection and processing*

Data is first collected using data acquisition parsers. Then we store it in a relational databse in a parser data table before we process it. The data acquisition pipeline runs continuously to collect the most up-to-date data.

Some sources are delayed compared to others which can impact the final consumption mix. In some cases, data can be missing or needs to be estimated because of poor data quality. Before running the flow-tracing pipeline, all missing data points and specified zones are estimated using the estimation pipeline. This pipeline runs hourly for the last 3 days. Once the estimations are implemented, the flow-tracing pipeline runs for the last 3 days of data. This enables us to have the most accurate state of a grid at any given point in time, for electricity consumption and carbon intensity.

The final data is then stored in the flow-traced data table. This flow-traced data is represented on the app and on the API.

Tracing back the origin of electricity

Electricity networks

Electricity grids are networks which contain power lines and power plants spanning across multiple areas. A simplified representation consists in dividing up the electricity network in zones within which the electricity is assumed to flow freely without any restrictions. These zones can represent countries, states or even islands, depending on the data available.

Electricity can flow between these zones using interconnectors. They enable imports and exports of electricity between neighboring zones. The limited capacity of the interconnector imposes restrictions on how much electricity can flow to and from a neighboring zone.

The electricity available on a given grid is determined by the share of power supplied by each power plant connected to that grid. In the example shown below (Figure 2), if a coal power plant produces twice the power of a wind turbine, then the final mix is twice as coal-intensive as it is wind-intensive. It is important to note that in this situation, a consumer cannot choose to consume electricity generated exclusively from wind. They can only consume electricity with proportions given by the share of sources.

*Figure 2. Example of a grid with a coal power plant and a wind power plant*

‍

These characteristics have the following main consequences:

The electricity exported from a given zone is the same as the electricity available in that zone, as one can’t unmix the electricity to selectively export some elements of it.
The electricity available in a given zone is the result of mixing electricity generated from each local power plant in that zone with electricity imported from each neighboring zone.
Since the mix of electricity imported might come from a zone that also imports electricity, the mix of electricity available in a given zone is influenced by the entire chain of imports. That chain starts with neighboring zones that provide imported electricity, extends to their respective neighbors and ends when no neighbors providing imported electricity can be reached.

The flow-tracing methodology

The flow-tracing methodology is used to trace back the mix of electricity available in a given area, even in the presence of loop-flows. The methodology is a concept introduced in this peer-reviewed paper, applied on the European electricity grid in this paper, and on the US grid in this paper.

The resulting mixes in each zone depict where the electricity available in a given zone originates. Furthermore, flow-tracing shows the propagation of electricity locally generated (see figure below).

*Figure 3. Representation of the electricity flows during the first hour of January 1, 2017.*

‍

In the figure, the width of the arrows is proportional to the amount of electricity transmitted. The cascade of electricity flows from German wind and Polish coal are highlighted with blue and brown arrows, respectively. The cascade stops when the share of German wind or Polish coal becomes too small to be drawn. This happens in France for instance, as the local generation is very large compared to the imported German wind, which causes the exported electricity to contain almost no German wind.

Once the origin of electricity is identified and broken down by power plant type the carbon impact of a zone’s electricity production and consumption is computed using emission factors computed through life-cycle analysis.

Figure 4 shows the importance of taking into account the full chain of imports. The carbon intensity of the consumption mix (computed using the flow-tracing algorithm) is influenced by the electricity exchanges and can be much higher or lower than the carbon intensity of the electricity generated in the zone. For example, in Austria electricity is mainly generated from low-carbon sources and the average carbon intensity of electricity production is quite low compared to the European average. The carbon intensity of the electricity consumption mix is significantly higher because of the imports from neighboring zones such as Germany and the Czech Republic which have a more carbon intensive production mix.

*Figure 4. Comparison of average carbon intensities for the production and consumption mixes in Europe in 2022*

Looking for more detailed information?

Check out the following blog posts about the flow-tracing methodology, how our flow-tracing pipeline runs, and about production vs consumption carbon intensity.

Electricity Maps Forecasts

At Electricity Maps, we foresee a world where billions of grid-connected systems optimize when and where they consume electricity in order to reduce their cost and carbon. In this world, we want to help organizations and companies take optimal decisions when it comes to consuming electricity.

We have developed comprehensive forecasts to predict the most likely state of electricity mixes (including composite metrics such as its carbon intensity or share of renewables), interconnections, and day-ahead prices in grids worldwide, up to 72 hours ahead.

Unsure about what forecasts to use for your use case? Our blog contains valuable guides to explain how to build engaging products with our renewable energy forecasts, how co-optimizing for price and carbon can unlock a valuable business-case while ensuring reliable emission savings, or why forecasted marginal emissions are unsuited to reduce one's carbon footprint.

Here, we will explain the methodology behind our forecasting platform.

Our Inputs

We give our models a multitude of inputs. These inputs are assembled into features, which our models use to extract any relevant information that can help predict the future state of electricity grids.

Weather data is crucial to predict future evolutions of electricity grids. Colder days will for example lead to an increased demand for electricity as buildings will need additional power for heating; that in turn could result in higher electricity prices and emissions.
- The Global Forecasting System (GFS) is, for example, one of the weather inputs used by our models. GFS data is often used as a cornerstone for software relying on weather data due to its accessibility, and global coverage.
- Whatever external weather data provider Electricity Maps relies on, a processing step is applied to these forecasts to ensure standardization and completeness of these inputs, as well as aligning their spatial granularity with the rest of our data

The Electricity Maps historical and real-time database constitutes a crucial source of inputs for electricity grid data. Our forecasting models can therefore seamlessly leverage any features built from granular past and scheduled electricity production, prices, or exchanges.

All these inputs are assembled into features usable for our models, by our automated forecasting platform.

Our Models

Our models rely on two types of inputs to be able to generate forecasts for the next 72 hours of day-ahead prices, electricity mixes and carbon intensity.

Features, built as explained above, from a combination of weather and grid data inputs.
Parameters, which are defined during model training and stored in our model registry.

‍

We currently rely on tree-based methods that learn to leverage the provided features to predict, with the highest accuracy possible, day-ahead prices, electricity mixes and carbon intensity for all grids worldwide. On a high level, our models will thus use a series of sequential decision trees that iteratively split the data based on feature thresholds, with each new tree focusing on correcting the prediction errors of previous trees to continuously improve accuracy.

‍

Carbon Intensity and Renewable Share Forecasts

To predict the future state of electricity grids worldwide, we process the output of our models for electricity production and exchanges with our flow tracing algorithm. From the predicted nuclear, solar, etc power production in each zone, as well as the predicted exchanges between zones, flow tracing computes the future global origin of electricity, in each zone.

With the knowledge of the future origin of electricity at hand, it becomes possible to define the future carbon intensity, carbon-free energy percentage, and renewable energy percentage for electricity grids, globally.

Our grid forecasts are available through our API.

Day-ahead Price Forecasts

Our 72 hours day-ahead forecasts extend beyond the day-ahead market to unlock use cases requiring forecasts with a constant horizon or with a horizon beyond 24 hours.

Day-ahead electricity prices are published every day for all hours of the next day (from 00:00 to 23:59), sometimes with a sub-hourly granularity. The publication time of day-ahead prices varies based on grids. In most European countries, the day-ahead auction closes at 12:00 CET and is published within the following hour. In the USA, this is grid-dependent.

Our Data Sources

We train our forecasts to replicate the day-ahead prices, as defined at the market closure.

Europe:
- For most European countries, we rely on Nord Pool to source day-ahead prices. This multinational power exchange operator covers most of Central and Western Europe, as well as the Nordic and the Baltic countries.
- For countries not covered by Nord Pool, we source day-ahead price data directly from local market operators. That includes, for example, OMIE for Spain and Portugal, or GME in Italy.

For more details about our data sources, see the data sources tab of our methodology.

‍

Accessing our forecasts

Our day-ahead price forecasts are available through our API.

When one queries the price forecast endpoint, a response similar to the following is returned:

{
    "zone": "DE",
    "data": [
        {
            "created_at": "2025-01-28T12:01:59.749Z",
            "source": "nordpool.com",
            "price": 102.75,
            "currency": "EUR",
            "datetime": "2025-01-29T14:00:00.000Z"
        },
        {
            "created_at": "2025-01-28T12:01:59.749Z",
            "source": "nordpool.com",
            "price": 132.66,
            "currency": "EUR",
            "datetime": "2025-01-29T15:00:00.000Z"
        },
        ...,
        {
            "created_at": "2025-01-29T12:20:51.276Z",
            "source": "nordpool.com",
            "price": 120.20,
            "currency": "EUR",
            "datetime": "2025-01-30T22:00:00.000Z"
        },
        {
            "created_at": "2025-01-29T13:49:43.496Z",
            "source": "Electricity Maps Forecast",
            "price": 99.08,
            "currency": "EUR",
            "datetime": "2025-01-30T23:00:00.000Z"
        },
        ...,
        {
            "created_at": "2025-01-29T13:50:02.491Z",
            "source": "Electricity Maps Forecast",
            "price": 326.97,
            "currency": "EUR",
            "datetime": "2025-02-01T14:00:00.000Z"
        }
    ]
}

‍

Note that the source of the data changes with time. That is because, as explained by the chart below, our API reconciles the actual day-ahead prices with our forecasts. That ensures that, when available, users of the API will receive the published day-ahead prices, which are then extended up to 72 hours by our forecasts.

‍

‍

Electricity Maps Methodology

Introduction

Data sources

Where does our data come from?

Finding data sources

Verifying data sources - Technical requirements

Which types of data does Electricity Maps exclude?

Electricity Maps parsers

Carbon intensity and emission factors

What is carbon intensity?

How is carbon intensity calculated?

Lifecycle emissions for electricity generation

Computing carbon intensity of electricity

Which emission factors do we use?

Emission factors on the app

Default emission factors

Regional emission factors

Low-carbon and renewable percentage

Looking for more detailed information?

Missing Data

Why estimate electricity production?

Short term data gaps: Time Slicer Average (TSA) Estimation Model

Missing real-time data: Hierarchal Estimations

Only yearly electricity production available: General Purpose Zone Development (GPZD)

Other estimation methods

Estimated data vs real-time data

Looking for more detailed information?

Data collection and processing

Electricity Maps data pipeline

Tracing back the origin of electricity

Electricity networks

The flow-tracing methodology

Looking for more detailed information?

Electricity Maps Forecasts

Our Inputs

Our Models

Carbon Intensity and Renewable Share Forecasts

Day-ahead Price Forecasts

Our Data Sources

Accessing our forecasts

Get started today