Disclaimer : Real Data API only extracts publicly available data while maintaining a strict policy against collecting any personal or identity-related information.
Crawls arbitrary websites using the Chrome browser and extracts data from pages using a provided JavaScript code. The actor supports both recursive crawling and lists of URLs and automatically manages concurrency for maximum performance. This is RealdataAPI's basic tool for web crawling and scraping.
Harness the capabilities of our Clutch.co Data Scraper to extract valuable data. Retrieve in-depth company details, numeric focus, genuine client reviews, portfolios, and more from Clutch.co's expansive commercial database. Effortlessly explore top company listings and conduct precise targeted searches.
While Clutch.co lacks a versatile or free API, our scraper serves as an unofficial Clutch API, enabling you to efficiently extract the data you require at scale. The Clutch.co Scraper offers the following features:
Clutch.co is a platform featuring in-depth client reviews, data-driven content, and vetted market leaders. Extracting and structuring this content through scraping provides invaluable business insights, offering a competitive edge.
The scraper is actively evolving, and we welcome your feature requests. Feel free to create an issue here. Anticipated updates include:
Enhancements to capture detailed information about resources.
Expanded functionality to enable searches based on keywords or specific resources.
Integration of complete reviews for a more comprehensive understanding of user feedback.
Stay tuned for these upcoming changes, which aim to further enhance the scraper's capabilities and provide a richer user experience.
Please find the updated version of the text below:
This scraper's input configuration is specified in JSON format, which includes the list of Clutch.co pages to visit. The key fields that need to be considered are:
- search (Optional) (String): A keyword for Clutch.co search, which is mandatory when the mode is specified.
- customMapFunction (Optional) (String): A function which takes every object's handle like an argument, performs functions, and returns object.
- endPage (Optional) (Number): The final page number for scraping; the default is infinite. This works with all the search requests as well as startUrls individually.
- extendOutputFunction (Optional) (String): A function that accepts a jQuery handle ($) as an argument and returns an object with data.
- includeReviews (Optional) (Boolean): An optional parameter (default: false) to add reviews into the profile objects. Set to true to scrape company reviews.
- maxItems (Optional) (Number): Limits the number of scraped items, which is useful for handling extensive lists or search results.
- mode (Optional) (String): The mode for utilizing the search keyword; the values can only be profiles and companies. This is mandatory when the search field is defined.
- proxy (Required) (Proxy Object): Configures the proxy settings, necessitating either personal proxy servers or utilizing the Real Data API Proxy.
- startUrls (Optional) (Array): A list of Clutch.co URLs; provide either a list or detailed URLs.
Please make sure that the JSON formatting is correct in order to execute the scraper with the specified input parameters smoothly.
This solution requires the use of proxy servers, either your own proxy servers or Real Data API Proxy.
To initiate scraping for a specific listing URL, simply copy and paste the link as one of the startURLs. If you intend to scrape only the first page of a list, provide the page link and set the endPage as 1. Note that enabling the includeReviews parameter generates multiple requests per company, potentially leading to higher consumption of requests or CUs. Exercise caution when setting this option to true.
The Clutch.co Scraper is finely tuned for rapid performance, prioritizing listing detail requests for optimal efficiency. Under minimal blocking conditions, it can swiftly scrape 100 listings in 2 minutes, consuming approximately 0.07-0.08 compute units. This emphasis on speed ensures efficient and effective extraction of data.
{
"startUrls": [
"https://clutch.co/profile/smartsites",
"https://clutch.co/web-developers/freelance",
"https://clutch.co/profile/blue-collar-agency"
],
"search": "api",
"mode": "companies",
"endPage": 1,
"maxItems": 50,
"includeReviews": false,
"customMapFunction": "(object) => { return object }"
}
Throughout the execution, the actor generates informative messages indicating the current page from the provided list. When items are successfully loaded from a page, corresponding messages display the count of loaded items and the total item count for that page. In the event of incorrect input, the actor promptly halts, entering a failure state, and provides an explanation of the encountered issue. This transparent communication system ensures users are well-informed about the progress and outcomes of the actor run.
Throughout the execution, the actor stores results in a dataset, with each item represented as a distinct entry. Users have the flexibility to manage these results in any programming language, whether it's Python, PHP, or Node.js/NPM. For detailed guidance on extracting results from this Clutch.co actor, refer to the FAQ or consult our API reference for comprehensive information.
The structure of all items in the Clutch.co listing look like this:
{
"url": "https://clutch.co/profile/smartsites",
"summary": {
"name": "SmartSites",
"logo": "https://img.shgstatic.com/clutchco-static/image/scale/60x60/s3fs-public/logos/a33a9494d9c0f41b112e8a1b4354a3e9.png",
"title": "Think Web. Think Smart. 💡",
"rating": 5,
"noOfReviews": 56,
"description": "Outsmart the competition with best-in-class digital marketing services. SmartSites, America's #1 rated digital marketing agency, boasts over 450 ⭐⭐⭐⭐⭐ reviews online. Call 📞 (201) 870 6000 for a free consultation! Serving businesses of all sizes, SmartSites is a Google Premier Partner and Facebook Marketing Partner. Winner of multiple website design awards and four-time Inc5000 (2017-2020) fastest growing company. Let us grow your company. Read more...",
"verificationStatus": "GOLD VERIFIED",
"minProjectSize": "$1,000+",
"averageHourlyRate": "$100 - $149 / hr",
"employees": "10 - 49",
"founded": "Founded 2011",
"addresses": [
{
"title": "headquarters",
"street": "45 Eisenhower Drive",
"locality": "Paramus",
"region": "NJ",
"postalCode": "07652",
"country": "United States",
"phone": "+1.201.870.6000"
}
]
},
"focus": [
{
"title": "Client focus",
"values": [
{
"name": "Small Business (<$10M)",
"percentage": 80
},
{
"name": "Midmarket ($10M - $1B)",
"percentage": 20
}
]
}
],
"portfolio": [
{
"image": "https://static2.clutch.co/s3fs-public/portfolio/28eae3874f8afd9e23a14f332d86353d.jpeg?N66vx9xl9cXLR5H6.euqT_BkCwjjAYD8",
"description": "Web Design, SEO, PPC"
}
],
"verification": {
"verificationStatus": "GOLD VERIFIED",
"businessEntity": {
"name": "Melen, LLC",
"status": "Active",
"jurisdictionOfFormation": "New Jersey",
"ID": "0600372607",
"source": "New Jersey Division of Revenue & Enterprise Services",
"lastUpdated": "November 1, 2020",
"dateOfFormation": "May 15, 2011"
},
"paymentLegalFilings": {
"bankruptcy": "No",
"taxLienFilings": "0",
"judgementFilings": "0",
"collectionsCount": "0",
"source": "New Jersey Division of Revenue & Enterprise Services",
"lastUpdated": "November 1, 2020",
"fullBusinessCreditReport": "https://www.smartbusinessreports.com/search2.aspx?link=1352&fn=951794204"
}
},
"reviews": [
{
"name": "SEO & PPC Services for Outdoor Refinishing Company",
"datePublished": "May 25, 2021",
"project": {
"name": "SEO & PPC Services for Outdoor Refinishing Company",
"category": "SEO & PPC",
"size": "$10,000 to $49,999",
"length": "Sep. 2020 - Jun. 2021",
"description": "SmartSites provided SEO and PPC services for an outdoor refinishing company, including adding backlinks and creating new content. They planned monthly activities to increase traffic and conversions."
},
"review": {
"rating": 5,
"quality": 5,
"schedule": 5,
"cost": 5,
"willingToRefer": 5,
"comments": "The team's work resulted in increased traffic and conversions. Thanks to SmartSites' efforts, cost per conversion (CPC) went down by 25%. They provided consistent updates and showed detailed reports about the project's status. Their timeliness and remarkable work ensured the engagement's success."
},
"reviewer": {
"title": "Owner, General Manager, Teak & Deck",
"name": "Drew Isaacman",
"industry": "Construction",
"size": "1-10 Employees",
"location": "Carlsbad, California",
"reviewType": "Online Review",
"verified": "Verified"
}
}
],
"websiteUrl": "https://www.smartsites.com/lp/digital-marketing-lp/?utm_source=clutch
}
Explore our product offerings by visiting our website. Discover a diverse range tailored to your needs. For custom integrations or specific requirements, reach out to us. We're here to assist and provide personalized solutions for you.
Check out how industries are using Clutch.co Data Scraper around the world.
E-commerce & Retail