logo

Google Play Scraper | Scrape Google Play Store Data

RealdataAPI / google-play-scraper

If you're looking for a way to scrape reviews and other important data from Google Play without taking hours, opt for Google Play Data Scraper. Scrape key details like title, app price, app ratings, download rates, screenshots, release date, version number, and developer info for all countries and languages through the Google Play Scraper API. Multi-format file exportation, including XML, JSON, CSV, Excel, and HTML, is made possible through our tool that operates almost in real-time. This service is available in a number of countries, including Australia, Canada, Germany, France, Singapore, USA, UK, UAE, and India.

What is Google Play Scraper, and How does it Work?

Since there is Google Play official free and good API, this scraper should assist you in scraping data from Google Play.

This Google Play Data Scraper helps with the following features:

  • Collect instant reviews - you can search for any query you want to scrape and get the output.
  • Any region or language - extract any list you want to collect from the Google Play app.
  • Search for any query - you can search for any query you want to scrape data for and get the output.
  • Get multiple applications from any developer - you can check and extract the data for the latest updates.

Updates, Bugs, Changelog, and Fixes

You can mail us anytime if you face any issues using this API or have any feature suggestions or requests.

Google Play Scraper Input Parameters

You should insert the input in JSON format to this scraper with the Google Play pages list you want to visit. Check the below-required data fields.

Field Type Description

startUrls

Array

startUrls Array It is the optional input field to place the Google Play URL list. You must only give developer page, search, or application page links.

includeReviews

Boolean

This input field will add each review that the source platform gives in application objects. Note that the API's resources and time will proportionally increase by the review count.

endReviewsPage

Integer

endReviewsPage Integer It is the optional field to feed the final review count page you wish to scrape.

proxy

Object

Proxy configuration

Using this API would help if you used proxy servers to scrape data. You can use your proxy or try the Real Data API proxy.

Advice

If you wish to extract many application reviews, use endReviewsPage. Though this API can scrape all the reviews, the total resource consumption will increase significantly.

Calculate Unit Consumption

The API is optimized to extract more and more details quickly. Hence it forefronts each detail request. If Google Play doesn't block this API, it will scrape a hundred apps within a minute with 0.001 to 0.003 compute units.

Input Example of Google Play Scraper

{
  "startUrls":[
    "https://play.google.com/store/search?q=hello&c=apps&hl=tr&gl=US",
    "https://play.google.com/store/apps/developer?id=Mattel163+Limited",
    "https://play.google.com/store/apps/details?id=com.tinybuildgames.helloneighbor&hl=tr&gl=US"
  ],
  "includeReviews":true,
  "endReviewsPage":1,
  "proxy":{
    "useRealdataAPIProxy":true
  }
}

While Executing

The API will give you output messages about what's happening. Every message includes a short specifications label to inform which page the API is scraping. After loading items from the page, you must read a message about this activity with the total item count and loaded item count for every page.

If you give the wrong input to the API, it will fail to execute and give you the failure reason in the output.

Google Play Export

While executing, the API saves the output into a dataset. There are separate columns and rows for each item in the dataset.

You can export outputs in any coding language like NPM, PHP, Python, etc.

Google Play Scraped Properties

The format of all the Google Play applications is below.

Application Detail

https://api.RealdataAPI.com/v2/datasets/fQ8QnvDv3BkHQfHlk/items?clean=true&format=json

Industries

Check out how industries use Google Play Scraper worldwide.

saas-btn.webp

E-commerce & Retail

You should have a Real Data API account to execute the program examples. Replace < YOUR_API_TOKEN > in the program using the token of your actor. Read about the live APIs with Real Data API docs for more explanation.

import { RealdataAPIClient } from 'RealdataAPI-Client';

// Initialize the RealdataAPIClient with API token
const client = new RealdataAPIClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare actor input
const input = {
    "startUrls": [
        "https://play.google.com/store/search?q=hello&c=apps&hl=tr&gl=US",
        "https://play.google.com/store/apps/developer?id=Mattel163+Limited",
        "https://play.google.com/store/apps/details?id=com.tinybuildgames.helloneighbor&hl=tr&gl=US"
    ],
    "endReviewsPage": 2,
    "proxy": {
        "useRealdataAPIProxy": true
    }
};

(async () => {
    // Run the actor and wait for it to finish
    const run = await client.actor("epctex/google-play-scraper").call(input);

    // Fetch and print actor results from the run's dataset (if any)
    console.log('Results from dataset');
    const { items } = await client.dataset(run.defaultDatasetId).listItems();
    items.forEach((item) => {
        console.dir(item);
    });
})();
from RealdataAPI_client import RealdataAPIClient

# Initialize the RealdataAPIClient with your API token
client = RealdataAPIClient("<YOUR_API_TOKEN>")

# Prepare the actor input
run_input = {
    "startUrls": [
        "https://play.google.com/store/search?q=hello&c=apps&hl=tr&gl=US",
        "https://play.google.com/store/apps/developer?id=Mattel163+Limited",
        "https://play.google.com/store/apps/details?id=com.tinybuildgames.helloneighbor&hl=tr&gl=US",
    ],
    "endReviewsPage": 2,
    "proxy": { "useRealdataAPIProxy": True },
}

# Run the actor and wait for it to finish
run = client.actor("epctex/google-play-scraper").call(run_input=run_input)

# Fetch and print actor results from the run's dataset (if there are any)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)
# Set API token
API_TOKEN=<YOUR_API_TOKEN>

# Prepare actor input
cat > input.json <<'EOF'
{
  "startUrls": [
    "https://play.google.com/store/search?q=hello&c=apps&hl=tr&gl=US",
    "https://play.google.com/store/apps/developer?id=Mattel163+Limited",
    "https://play.google.com/store/apps/details?id=com.tinybuildgames.helloneighbor&hl=tr&gl=US"
  ],
  "endReviewsPage": 2,
  "proxy": {
    "useRealdataAPIProxy": true
  }
}
EOF

# Run the actor
curl "https://api.RealdataAPI.com/v2/acts/epctex~google-play-scraper/runs?token=$API_TOKEN" /
  -X POST /
  -d @input.json /
  -H 'Content-Type: application/json'

Start URLs

startUrls Optional Array

Links to begin with You should feed the list or a detailed link.

Include Reviews

includeReviews Optional Boolean

It contains app reviews which will accelerate resource utilization with application count proportion.

Reviews End Page

endReviewsPage Optional Integer

This is the page you wish to end reviews with. There is no ending page by default. You can apply for reviews only and restrict their pagination.

Proxy Configuration

proxy Optional Object

Choose proxy servers so your crawler can use them.

{
  "startUrls": [
    "https://play.google.com/store/search?q=hello&c=apps&hl=tr&gl=US",
    "https://play.google.com/store/apps/developer?id=Mattel163+Limited",
    "https://play.google.com/store/apps/details?id=com.tinybuildgames.helloneighbor&hl=tr&gl=US"
  ],
  "includeReviews": false,
  "endReviewsPage": 2,
  "proxy": {
    "useRealdataAPIProxy": true
  }
}