Download List

프로젝트 설명

Ebot is a scalable and distribuited Web crawler. The URLs are saved to a NOSQL database (which supports map/reduce queries) that you can query via RESTful HTTP requests or using your preferred programming languages. The URLs that need to be analyzed are sent to AMQP queues. In this way, it is possible to run several crawlers in parallel and stop and start them without losing URLs.

System Requirements

System requirement is not defined
Information regarding Project Releases and Project Resources. Note that the information here is a quote from Freecode.com page, and the downloads themselves may not be hosted on OSDN.

2010-12-30 13:03
0.9

A는 더 나은 아키텍처를 플러그인. 이미지 URL과 HTML 페이지의 제목을 절약을위한 새로운 플러그인이 없습니다.
A better plugin architecture. New plugins for saving image URLs and titles of HTML pages.

2010-12-18 18:59
0.8

이 릴리스는 rabbitmq (2.2.0)와 couchbeam의 최신 릴리스로 업데이 트되었습니다. 일부 버그가 수정되었습니다.
This release was updated to the latest releases of rabbitmq (2.2.0) and couchbeam. Some bugs were fixed.

2010-09-18 07:05
0.7

Erlang R14A와의 호환성. 데비안 테스팅 테스트.
Compatibility with Erlang R14A. Tested with Debian Testing.

2010-08-27 07:24
0.6

최신 (개발)와 호환성이 필요한 라이브러리 및 소프트웨어 (rabbitmq, couchdb, webmachine, riak 등) 출시
Compatibility with the latest (development) releases of required libraries and software (rabbitmq, couchdb, webmachine, riak, etc.)

2010-06-20 23:18
0.5

더 나은 확장성 들어, 신규 amqp 대기열이 (.* .* .* .* ebot.refused)와 크롤 러 (ebot_web의 옛 코어) ebot.completed 지금은 두 개의 서로 다른 병렬 모듈에서 분할되어 ebot.fetched ebot.new를 사용하는 / 프로세스 (ebot_html과 ebot_web).
For better scalability, new amqp queues are used (ebot.new.* ebot.fetched.* ebot.completed.* ebot.refused.*) and the old core of the crawler (ebot_web) is now split in two different and parallel modules/processes (ebot_html and ebot_web).

Project Resources