AmzPy: Async-First Amazon Data Engine

“Most agencies use off-the-shelf tools that get blocked the moment you scale. We own our infrastructure. We built the libraries that handle the networking, the stealth, and the data parsing.”

🔧 Project Summary

AmzPy is a lightweight, fully async Python library tailored for extracting deep product data from Amazon. It powers the entire backend of SmartGamer.in, enabling real-time price tracking and deal discovery.

💡 Solving the PA-API Gatekeeping

The project started when Amazon denied me access to their official Product Advertising API (PA-API) for my gaming deals platform. Rather than abandoning the project, I built a scraper that was faster, more flexible, and didn’t require Amazon’s permission.

🧱 Specialized Features

Smart Variant Handling: Amazon’s Size/Color variants are notoriously difficult to scrape. AmzPy handles this by intelligently rotating through element tags to extract accurate ASIN data across different country domains.
Async-First Architecture: Built specifically to work with modern frameworks like FastAPI, allowing for hundreds of concurrent requests without blocking the event loop.
Efficiency Over Headless: By using curl_cffi instead of resource-heavy headless browsers (like Selenium or Playwright), AmzPy significantly reduces RAM costs and increases scraping speed by up to 5x.
Search & Discovery: Beyond single products, it extracts rich data from search result pages, including discount percentages, ratings, and Prime eligibility.

⚙️ The Impact

AmzPy is the sole data provider for SmartGamer.in. It processes thousands of gaming products daily, identifying “All-Time Low” price drops and trusted seller listings automatically.

When you work with us, you aren’t just getting a developer; you’re getting a battle-tested scraping engine.