Machine learning and satellite imagery converge to produce the first truly global census of solar power — and the numbers challenge everything we thought we knew.
Here is a strange fact about the energy transition: until very recently, nobody actually knew how many solar panels existed on Earth. Governments tracked utility-scale farms. Utilities counted grid-connected rooftops. But the full picture — every panel on every warehouse, every off-grid installation in rural India, every floating array on a Chinese reservoir — was a blank. This dataset fills that blank. Using machine learning trained on satellite imagery, researchers identified and geolocated 68 million solar photovoltaic generating units across the globe, creating the first inventory that doesn't depend on self-reporting or administrative records alone.
The findings upended several assumptions. China's installed solar capacity was 15% larger than official statistics suggested, with thousands of small-scale installations escaping provincial reporting. India's rooftop solar sector was growing faster than any government projection had anticipated. And across sub-Saharan Africa, off-grid solar was proliferating in patterns that bore no resemblance to national energy plans — following mobile phone tower networks rather than electrical grid expansion routes. The data revealed an energy transition that was messier, faster, and more bottom-up than top-down planning frameworks assumed.
What makes this inventory particularly potent is its granularity. Each entry includes not just a location but an estimated generating capacity, enabling researchers to build capacity maps that update our understanding of global solar infrastructure. With 16,000 downloads and counting, the dataset has already been incorporated into energy models, climate projections, and policy analyses at the International Energy Agency, the World Bank, and dozens of national energy ministries. It is, in effect, the first honest photograph of the solar revolution.
Distribution of identified solar installations across global regions
Total estimated generating capacity of mapped installations over time (GW)
The data revealed an energy transition that was messier, faster, and more bottom-up than top-down planning frameworks assumed.
The first global baseline for tracking solar deployment enables year-over-year monitoring of whether the world is installing panels fast enough to meet Paris Agreement targets.
National energy statistics in 40+ countries may need revision. Policymakers relying on self-reported data have been working with incomplete pictures of their own solar infrastructure.
This dataset demonstrates that ML-driven satellite analysis can produce infrastructure inventories faster, cheaper, and more accurately than traditional survey methods — a model now being applied to wind, hydro, and grid infrastructure.
Share this story