back to home

dadoonet / fscrawler

Elasticsearch File System Crawler (FS Crawler)

View on GitHub
1,435 stars
305 forks
105 issues
JavaRich Text FormatHTML

AI Architecture Analysis

This repository is indexed by RepoMind. By analyzing dadoonet/fscrawler in our AI interface, you can instantly generate complete architecture diagrams, visualize control flows, and perform automated security audits across the entire codebase.

Our Agentic Context Augmented Generation (Agentic CAG) engine loads full source files into context on-demand, avoiding the fragmentation of traditional RAG systems. Ask questions about the architecture, dependencies, or specific features to see it in action.

Source files are only loaded when you start an analysis to optimize performance.

Embed this Badge

Showcase RepoMind's analysis directly in your repository's README.

[![Analyzed by RepoMind](https://img.shields.io/badge/Analyzed%20by-RepoMind-4F46E5?style=for-the-badge)](https://repomind.in/repo/dadoonet/fscrawler)
Preview:Analyzed by RepoMind

Repository Overview (README excerpt)

Crawler view

File System Crawler for Elasticsearch Welcome to the FS Crawler for Elasticsearch This crawler helps to index binary documents such as PDF, Open Office, MS Office. **Main features**: • Local file system (or a mounted drive) crawling and index new files, update existing ones and removes old ones. • Remote file system over SSH/FTP crawling. • REST interface to let you "upload" your binary documents to elasticsearch. Latest versions Current "most stable" versions are: | Elasticsearch | FS Crawler | Released | Docs | |---------------|---------------|------------|-------------------------------------------------------------------------------| | 7.x, 8.x, 9.x | 2.10-SNAPSHOT | | 2.10-SNAPSHOT | Build and Quality Status GitHub stats Documentation The guide has been moved to ReadTheDocs. Contribute Works on my machine - and yours ! Spin up pre-configured, standardized dev environments of this repository, by clicking on the button below. License Read more about the Apache2 License. Thanks Thanks to JetBrains for the IntelliJ IDEA License! Thanks to SonarCloud for the free analysis!