rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
Go to file
2019-06-12 17:44:47 +02:00
.vscode pass around config object 2019-06-07 19:00:24 +02:00
exampledir tesseract single threaded 2019-06-12 17:44:47 +02:00
src tesseract single threaded 2019-06-12 17:44:47 +02:00
.gitignore pass around config object 2019-06-07 19:00:24 +02:00
Cargo.lock add tesseract adapter 2019-06-12 17:23:30 +02:00
Cargo.toml add tesseract adapter 2019-06-12 17:23:30 +02:00
README.md add tesseract adapter 2019-06-12 17:23:30 +02:00
rustfmt.toml initial working version 2019-06-04 20:08:26 +02:00

similar:

todo

  • jpg adapter (based on object classification / detection (yolo?)) for fun
  • 7z adapter (couldn't find a nice to use rust library)

considerations

  • matching on mime (magic bytes) instead of filename
  • allow per-adapter configuration options

Development

To enable debug logging:

export RUST_LOG=debug
export RUST_BACKTRACE=1

Also rember to disable caching with --rga-no-cache or clear the cache in ~/.cache/rga to debug the adapters.