back to home

sunny9577 / proxy-scraper

ā­ļø A proxy scraper made using Protractor | Proxy list Updates every three hour šŸ”„

View on GitHub
570 stars
82 forks
2 issues

AI Architecture Analysis

This repository is indexed by RepoMind. By analyzing sunny9577/proxy-scraper in our AI interface, you can instantly generate complete architecture diagrams, visualize control flows, and perform automated security audits across the entire codebase.

Our Agentic Context Augmented Generation (Agentic CAG) engine loads full source files into context on-demand, avoiding the fragmentation of traditional RAG systems. Ask questions about the architecture, dependencies, or specific features to see it in action.

Source files are only loaded when you start an analysis to optimize performance.

Embed this Badge

Showcase RepoMind's analysis directly in your repository's README.

[![Analyzed by RepoMind](https://img.shields.io/badge/Analyzed%20by-RepoMind-4F46E5?style=for-the-badge)](https://repomind.in/repo/sunny9577/proxy-scraper)
Preview:Analyzed by RepoMind

Repository Overview (README excerpt)

Crawler view

ā­ļø A proxy scraper made using Protractor | Proxy list - Updates every 3 hours šŸ”„ Current Proxy Count: 1800+ šŸš€ Quick Access - By Format TXT curl "https://sunny9577.github.io/proxy-scraper/proxies.txt" > proxies.txt JSON curl "https://sunny9577.github.io/proxy-scraper/proxies.json" > proxies.json CSV curl "https://sunny9577.github.io/proxy-scraper/proxies.csv" > proxies.csv YAML curl "https://sunny9577.github.io/proxy-scraper/proxies.yaml" > proxies.yaml XML curl "https://sunny9577.github.io/proxy-scraper/proxies.xml" > proxies.xml Quick Access - By Type Supported formats: txt | json | csv | yaml | xml SOCKS5 curl "https://sunny9577.github.io/proxy-scraper/generated/socks5_proxies.txt" > socks5_proxies.txt SOCKS4 curl "https://sunny9577.github.io/proxy-scraper/generated/socks4_proxies.txt" > socks4_proxies.txt HTTP(S) curl "https://sunny9577.github.io/proxy-scraper/generated/http_proxies.txt" > http_proxies.txt Setup Requirements 1 - A Windows, Mac or Linux Computer 2 - Google Chrome installed 3 - Nodejs and NPM installed Setup This bot is based on protractor automated browser testing software. 1 - Open a terminal and make sure you are in the project directory cd proxy-scraper 2 - Make sure you have protractor installed To check if you have protractor installed run: protractor --version If the output is something like this: Version 7.0.0 This means protractor is already installed, otherwise run the command: npm install -g protractor 3 - Install Selenium Webdriver This is where your program will run, to install it simply execute the command: webdriver-manager 4 - Database Connectivity ( Only if u want to save output to database) You can skip this step just set SAVE_TO_DB to false in config.js Go to the file config.js and change with your login information npm install mysql2 Table Structure (Table Name: proxies_tb) |Name|Type|Collation|Attributes|Null|Default|Comments|Extra| |----|----|---------|----------|----|-------|--------|-----| |1 |idPrimary |int(11) | |No |None |AUTO_INCREMENT |2 |proxyIndex |varchar(50) |latin1_swedish_ci |No |None |3 |port |int(10) | |No |None |4 |country |varchar(100) |latin1_swedish_ci |No |None |5 |type |varchar(100) |latin1_swedish_ci |No |None |6 |anonymity |varchar(100) |latin1_swedish_ci |No |None Running the bot 1 - Update the Selenium webserver and install dependencies Open a terminal and make sure you are in the project directory: cd proxy-scraper Update webdrivers: webdriver-manager update Install dependencies: npm install 2 - Run the bot Its time to start retrieving the proxies, make sure you are in the project directory: cd proxy-scraper Run the bot: protractor conf.js Additional Tools • #### socks5_test.py This python script tests the SOCKS5 proxies against https://icanhazip.com and https://eset.com. Contributed by @techroy23 Steps to run: • pip3 install tabulate pysocks • python3 socks5_test.py Disclaimer This repository is intended solely for educational purposes. I neither endorse nor encourage any illegal activities or behavior. If you are the owner of a proxy included in any of the lists and wish to have it removed, please contact us by creating an issue. License MIT