# collector-http **Repository Path**: mirrors_andyglick/collector-http ## Basic Information - **Project Name**: collector-http - **Description**: Norconex HTTP Collector is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines. - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2020-09-24 - **Last Updated**: 2026-02-21 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README Norconex HTTP Collector ======================== ![Norconex HTTP Collector Logo](https://www.norconex.com/collectors/img/collector-http.png) Norconex HTTP Collector is a full-featured **web crawler** (or spider) that can manipulate and store collected data into a repositoriy of your choice (e.g. a search engine). It very flexible, powerful, easy to extend, and portable. Can be used command-line with file-based configuration on any OS, or can be embedded into Java applications using well documented APIs. Visit the web site for binary downloads and documentation: ### https://www.norconex.com/collectors/collector-http/