rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
Go to file
2019-06-12 22:55:18 +02:00
.vscode
ci fixes 2019-06-12 22:32:20 +02:00
exampledir tesseract single threaded 2019-06-12 17:44:47 +02:00
src better documentation 2019-06-12 22:55:18 +02:00
.gitignore
.travis.yml don't care about 32bit linux 2019-06-12 22:13:08 +02:00
Cargo.lock sqlite3 bundled 2019-06-12 22:11:20 +02:00
Cargo.toml sqlite3 bundled 2019-06-12 22:11:20 +02:00
README.md better documentation 2019-06-12 22:55:18 +02:00
rustfmt.toml

rga - ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc

rga is a tool to recursively search for text in many different types of files. It is based on the awesome ripgrep.

Linux build status Crates.io

todo

  • jpg adapter (based on object classification / detection (yolo?)) for fun
  • 7z adapter (couldn't find a nice to use rust library)

considerations

  • matching on mime (magic bytes) instead of filename
  • allow per-adapter configuration options

Setup

rga should compile with stable Rust. To install it, simply run

apt install build-essential pandoc poppler-utils
cargo install ripgrep_all

rga --help

Some rga adapters run external binaries

Development

To enable debug logging:

export RUST_LOG=debug
export RUST_BACKTRACE=1

Also rember to disable caching with --rga-no-cache or clear the cache in ~/.cache/rga to debug the adapters.

Similar tools