rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
Go to file
phiresky d0a1f8962b doc
2019-06-13 15:22:33 +02:00
.vscode pass around config object 2019-06-07 19:00:24 +02:00
ci fixes 2019-06-12 22:32:20 +02:00
exampledir tesseract single threaded 2019-06-12 17:44:47 +02:00
src doc 2019-06-13 15:22:33 +02:00
.gitignore pass around config object 2019-06-07 19:00:24 +02:00
.travis.yml skip arm build for now 2019-06-12 23:05:13 +02:00
Cargo.lock sqlite3 bundled 2019-06-12 22:11:20 +02:00
Cargo.toml sqlite3 bundled 2019-06-12 22:11:20 +02:00
README.md doc 2019-06-13 15:22:33 +02:00
rustfmt.toml initial working version 2019-06-04 20:08:26 +02:00

rga - ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc

rga is a tool to recursively search for text in many different types of files. It is based on the awesome ripgrep.

Linux build status Crates.io

Future Work

  • photograph adapter (based on object classification / detection (yolo?)) for fun, based on something like this. Tried, but very hard to integrate (especially state of the art approaches).
  • 7z adapter (couldn't find a nice to use rust library)

considerations

  • matching on mime (magic bytes) instead of filename
  • allow per-adapter configuration options

Setup

rga should compile with stable Rust. To install it, simply run

apt install build-essential pandoc poppler-utils
cargo install ripgrep_all

rga --help

Some rga adapters run external binaries

Development

To enable debug logging:

export RUST_LOG=debug
export RUST_BACKTRACE=1

Also rember to disable caching with --rga-no-cache or clear the cache in ~/.cache/rga to debug the adapters.

Similar tools