Best Open Source spider Libraries
A curated list of the most popular GitHub repositories tagged with spider. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
#2gocolly/colly
Elegant Scraper and Crawler Framework for Golang
#3jhao104/proxy_pool
Python ProxyPool for web spider
#4shengqiangzhang/examples-of-web-crawlers
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、微信读书、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
#5s0md3v/Photon
Incredibly fast crawler designed for OSINT.
#6crawlab-team/crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
#7guyueyingmu/avbook
AV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
#8ihmily/DouyinLiveRecorder
可循环值守和多人录制的直播录制软件,支持抖音、TikTok、Youtube、快手、虎牙、斗鱼、B站、小红书、pandatv、sooplive、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、17Live、Twitch、Acfun、CHZZK、shopee等40+平台直播录制
#9bda-research/node-crawler
Web Crawler/Spider for NodeJS + server-side jQuery ;-)