Disclaimer : Real Data API only extracts publicly available data while maintaining a strict policy against collecting any personal or identity-related information.
If you're looking for a way to scrape reviews and other important data from Google Play without taking hours, opt for Google Play Data Scraper. Scrape key details like title, app price, app ratings, download rates, screenshots, release date, version number, and developer info for all countries and languages through the Google Play Scraper API. Multi-format file exportation, including XML, JSON, CSV, Excel, and HTML, is made possible through our tool that operates almost in real-time. This service is available in a number of countries, including Australia, Canada, Germany, France, Singapore, USA, UK, UAE, and India.
Since there is Google Play official free and good API, this scraper should assist you in scraping data from Google Play.
This Google Play Data Scraper helps with the following features:
You can mail us anytime if you face any issues using this API or have any feature suggestions or requests.
You should insert the input in JSON format to this scraper with the Google Play pages list you want to visit. Check the below-required data fields.
Field | Type | Description |
---|---|---|
startUrls |
Array |
startUrls Array It is the optional input field to place the Google Play URL list. You must only give developer page, search, or application page links. |
includeReviews |
Boolean |
This input field will add each review that the source platform gives in application objects. Note that the API's resources and time will proportionally increase by the review count. |
endReviewsPage |
Integer |
endReviewsPage Integer It is the optional field to feed the final review count page you wish to scrape. |
proxy |
Object |
Proxy configuration |
Using this API would help if you used proxy servers to scrape data. You can use your proxy or try the Real Data API proxy.
If you wish to extract many application reviews, use endReviewsPage. Though this API can scrape all the reviews, the total resource consumption will increase significantly.
The API is optimized to extract more and more details quickly. Hence it forefronts each detail request. If Google Play doesn't block this API, it will scrape a hundred apps within a minute with 0.001 to 0.003 compute units.
{
"startUrls":[
"https://play.google.com/store/search?q=hello&c=apps&hl=tr&gl=US",
"https://play.google.com/store/apps/developer?id=Mattel163+Limited",
"https://play.google.com/store/apps/details?id=com.tinybuildgames.helloneighbor&hl=tr&gl=US"
],
"includeReviews":true,
"endReviewsPage":1,
"proxy":{
"useRealdataAPIProxy":true
}
}
The API will give you output messages about what's happening. Every message includes a short specifications label to inform which page the API is scraping. After loading items from the page, you must read a message about this activity with the total item count and loaded item count for every page.
If you give the wrong input to the API, it will fail to execute and give you the failure reason in the output.
While executing, the API saves the output into a dataset. There are separate columns and rows for each item in the dataset.
You can export outputs in any coding language like NPM, PHP, Python, etc.
The format of all the Google Play applications is below.
https://api.RealdataAPI.com/v2/datasets/fQ8QnvDv3BkHQfHlk/items?clean=true&format=json
You should have a Real Data API account to execute the program examples. Replace < YOUR_API_TOKEN >
in the program using the token of your actor. Read about the live APIs with Real Data API docs for more explanation.
import { RealdataAPIClient } from 'RealdataAPI-Client';
// Initialize the RealdataAPIClient with API token
const client = new RealdataAPIClient({
token: '<YOUR_API_TOKEN>',
});
// Prepare actor input
const input = {
"startUrls": [
"https://play.google.com/store/search?q=hello&c=apps&hl=tr&gl=US",
"https://play.google.com/store/apps/developer?id=Mattel163+Limited",
"https://play.google.com/store/apps/details?id=com.tinybuildgames.helloneighbor&hl=tr&gl=US"
],
"endReviewsPage": 2,
"proxy": {
"useRealdataAPIProxy": true
}
};
(async () => {
// Run the actor and wait for it to finish
const run = await client.actor("epctex/google-play-scraper").call(input);
// Fetch and print actor results from the run's dataset (if any)
console.log('Results from dataset');
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
console.dir(item);
});
})();
from RealdataAPI_client import RealdataAPIClient
# Initialize the RealdataAPIClient with your API token
client = RealdataAPIClient("<YOUR_API_TOKEN>")
# Prepare the actor input
run_input = {
"startUrls": [
"https://play.google.com/store/search?q=hello&c=apps&hl=tr&gl=US",
"https://play.google.com/store/apps/developer?id=Mattel163+Limited",
"https://play.google.com/store/apps/details?id=com.tinybuildgames.helloneighbor&hl=tr&gl=US",
],
"endReviewsPage": 2,
"proxy": { "useRealdataAPIProxy": True },
}
# Run the actor and wait for it to finish
run = client.actor("epctex/google-play-scraper").call(run_input=run_input)
# Fetch and print actor results from the run's dataset (if there are any)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)
# Set API token
API_TOKEN=<YOUR_API_TOKEN>
# Prepare actor input
cat > input.json <<'EOF'
{
"startUrls": [
"https://play.google.com/store/search?q=hello&c=apps&hl=tr&gl=US",
"https://play.google.com/store/apps/developer?id=Mattel163+Limited",
"https://play.google.com/store/apps/details?id=com.tinybuildgames.helloneighbor&hl=tr&gl=US"
],
"endReviewsPage": 2,
"proxy": {
"useRealdataAPIProxy": true
}
}
EOF
# Run the actor
curl "https://api.RealdataAPI.com/v2/acts/epctex~google-play-scraper/runs?token=$API_TOKEN" /
-X POST /
-d @input.json /
-H 'Content-Type: application/json'
startUrls
Optional Array
Links to begin with You should feed the list or a detailed link.
includeReviews
Optional Boolean
It contains app reviews which will accelerate resource utilization with application count proportion.
endReviewsPage
Optional Integer
This is the page you wish to end reviews with. There is no ending page by default. You can apply for reviews only and restrict their pagination.
proxy
Optional Object
Choose proxy servers so your crawler can use them.
{
"startUrls": [
"https://play.google.com/store/search?q=hello&c=apps&hl=tr&gl=US",
"https://play.google.com/store/apps/developer?id=Mattel163+Limited",
"https://play.google.com/store/apps/details?id=com.tinybuildgames.helloneighbor&hl=tr&gl=US"
],
"includeReviews": false,
"endReviewsPage": 2,
"proxy": {
"useRealdataAPIProxy": true
}
}