AmzPy: Async-First Amazon Data Engine
A specialized Python library for high-speed Amazon scraping, capable of handling product variants, search pages, and multi-country domains without headless browsers.
AmzPy: Async-First Amazon Data Engine
“Most agencies use off-the-shelf tools that get blocked the moment you scale. We own our infrastructure. We built the libraries that handle the networking, the stealth, and the data parsing.”
🔧 Project Summary
AmzPy is a lightweight, fully async Python library tailored for extracting deep product data from Amazon. It powers the entire backend of SmartGamer.in, enabling real-time price tracking and deal discovery.
💡 Solving the PA-API Gatekeeping
The project started when Amazon denied me access to their official Product Advertising API (PA-API) for my gaming deals platform. Rather than abandoning the project, I built a scraper that was faster, more flexible, and didn’t require Amazon’s permission.
🧱 Specialized Features
- Smart Variant Handling: Amazon’s Size/Color variants are notoriously difficult to scrape. AmzPy handles this by intelligently rotating through element tags to extract accurate ASIN data across different country domains.
- Async-First Architecture: Built specifically to work with modern frameworks like FastAPI, allowing for hundreds of concurrent requests without blocking the event loop.
- Efficiency Over Headless: By using
curl_cffiinstead of resource-heavy headless browsers (like Selenium or Playwright), AmzPy significantly reduces RAM costs and increases scraping speed by up to 5x. - Search & Discovery: Beyond single products, it extracts rich data from search result pages, including discount percentages, ratings, and Prime eligibility.
⚙️ The Impact
AmzPy is the sole data provider for SmartGamer.in. It processes thousands of gaming products daily, identifying “All-Time Low” price drops and trusted seller listings automatically.
When you work with us, you aren’t just getting a developer; you’re getting a battle-tested scraping engine.