95 Commits

Author SHA1 Message Date
Bruce Tang
bf921fba04 Merge pull request #56 from ceylanb/patch-1
add hakrawler
2021-03-16 14:37:55 +08:00
Bruce Tang
f42479391f Merge pull request #65 from sandofvega/patch-1
add QueryList to PHP
2021-02-25 15:44:11 +08:00
Sand Of Vega
4a24579cdb add QueryList in PHP 2021-02-19 23:04:00 +06:00
Bruce Tang
7edc42bdbd Add spider-flow in Java section. 2021-01-26 16:07:32 +08:00
Bruce Tang
214ed7be60 Merge pull request #64 from YairHalberstadt/patch-1
Add InfinityCrawler
2020-12-28 11:04:42 +08:00
Yair Halberstadt
5066b34233 Add InfinityCrawler 2020-12-21 16:25:24 +00:00
Ceylan Bozoğullarından
e93a3884fd Hakrawler 2020-04-20 13:30:18 +03:00
Bruce Tang
5ce06579b1 Merge pull request #45 from machawk1/patch-1
Add Squidwarc
2020-04-14 15:23:05 +08:00
Bruce Tang
f60994b209 Merge pull request #54 from ehsanquddusi/patch-1
Update README.md
2020-02-18 16:39:46 +08:00
Bruce Tang
ae3dcae664 Merge pull request #55 from BruceDone/revert-50-patch-1
Revert "Added SimpleCrawler"
2020-02-15 15:54:46 +08:00
Bruce Tang
48769f6264 Revert "Added SimpleCrawler" 2020-02-15 15:54:30 +08:00
Bruce Tang
67c5735cb9 Merge pull request #50 from berti92/patch-1
Added SimpleCrawler
2020-02-15 15:53:36 +08:00
Ehsan Quddusi
48f947d9d5 Update README.md
Added crawlzone/crawlzone to PHP section
2020-01-06 00:06:41 +05:30
Andreas Treubert
14570ea628 Merge branch 'master' into patch-1 2019-10-29 13:12:07 +01:00
Bruce Tang
d008fbcbc2 Merge pull request #51 from slotix/master
Add Dataflow Kit to the list of go crawlers
2019-09-23 22:30:27 +08:00
Dmitry Narizhnykh
c03126345c Update README.md 2019-09-23 16:17:01 +02:00
Andreas Treubert
957994ae24 Update README.md 2019-09-02 00:07:01 +02:00
Mat Kelly
9b5f482e8e Add Squidwarc 2019-02-11 14:23:14 -05:00
Bruce Tang
c1afb24734 Merge pull request #35 from sebastian-nagel/master
Add CoCrawler (Python)
2019-02-10 11:06:16 +08:00
brucedone
4eab3a20e1 Add ferret to golang section 2018-10-09 11:31:03 +08:00
Bruce Tang
7158b3f278 Merge pull request #38 from howie6879/patch-1
Add an async web scraping framework.
2018-09-13 16:15:32 +08:00
howie.hu
0092b45d4d Add an async web scraping framework 2018-09-13 16:13:20 +08:00
Bruce Tang
d0818c7754 Merge pull request #29 from yujiosaka/add-headless-chrome-crawler
Add link to headless-chrome-crawler to JavaScript section.
2018-07-17 16:06:00 +08:00
Bruce Tang
906a3d4a0d Merge pull request #36 from GustavoRPS/master
Added Python Newspaper3k
2018-04-04 15:01:00 +08:00
Gustavo RPS
4cca9d7ca2 Added Python Newspaper3k
News, full-text, and article metadata extraction in Python 3
2018-04-02 10:10:31 -03:00
Sebastian Nagel
0c2dbd370f Add CoCrawler (Python) 2018-03-22 15:27:50 +01:00
Bruce Tang
5ff0642f5f Merge pull request #21 from aecio/patch-1
Added ACHE Crawler
2018-03-02 11:26:59 +08:00
Bruce Tang
c2ec9bb466 Merge pull request #33 from kaznovac/patch-1
Added spatie/crawler
2018-02-06 15:25:38 +08:00
Marko Kaznovac
1583ff2c49 Added spatie/crawler 2018-02-05 11:15:28 +01:00
Bruce Tang
0cca50eb12 Merge pull request #32 from Luxiyalu/master
Add Nokogiri to the list of Ruby crawlers.
2018-01-22 15:03:23 +08:00
Lucia Lu
6202282400 Add Nokogiri to the list of Ruby crawlers. 2018-01-21 17:32:31 -08:00
Bruce Tang
ee8affba57 Merge pull request #27 from rivermont/master
Add spidy Web Crawler.
2018-01-04 10:35:24 +08:00
Bruce Tang
6175d4442d Merge pull request #28 from yujiosaka/fix_broken_link
Fix broken link on table of contents
2017-12-12 09:58:15 +08:00
yujiosaka
2fad02f446 Add link to headless-chrome-crawler to JavaScript section 2017-12-12 05:30:39 +09:00
yujiosaka
216bb2dd28 Fix broken link on table of contents 2017-12-12 05:26:45 +09:00
Will Bennett
9f264dfdd9 Add spidy Web Crawler. 2017-12-11 08:37:18 -05:00
Bruce Tang
5d0ce784a0 Merge pull request #23 from zhuyingda/master
add a new nodejs crawler
2017-12-11 10:11:49 +08:00
Bruce Tang
cc82fc7472 Merge pull request #24 from briatte/master
add table of contents
2017-11-27 10:37:26 +08:00
François
7aff181a3b table of contents 2017-11-25 18:40:21 +01:00
朱英达
96c2b8f489 add a new nodejs crawler
add a new nodejs crawler, webster.
2017-11-24 23:43:13 +08:00
Aécio Santos
f563cf6457 Added ACHE Crawler
ACHE is a web crawler for domain-specific search.
Github: https://github.com/ViDA-NYU/ache
Documentation: http://ache.readthedocs.io
2017-10-09 13:00:33 -04:00
Bruce Tang
37279f57d2 Merge pull request #20 from asciimoo/master
Add Colly to golang section
2017-10-08 16:43:11 +08:00
Adam Tauber
8082a47dc5 add Colly to golang section 2017-10-06 13:38:33 +02:00
Bruce Tang
cbf15245cc Merge pull request #19 from tmos/patch-1
Add supercrawler
2017-07-24 15:09:20 +08:00
Tom Canac
0c024066f3 Add supercrawler 2017-07-23 15:35:16 -04:00
brucedone
72b583b51c add sukhoi to python section , fix the some small issues. 2017-07-17 15:47:55 +08:00
brucedone
a37f1ceaa4 add web-scraper-chrome-extension to JavaScript section 2017-06-22 11:01:15 +08:00
brucedone
e3824e0415 add creeper to go section 2017-06-09 11:26:46 +08:00
brucedone
4867edd660 add gain to python section 2017-06-07 10:13:56 +08:00
brucedone
379457805a add webBee to Java section 2017-04-26 10:40:45 +08:00