The default for max-bytes (to extract) is way to large
It's not just a performance issue, much metadata also means more false positives, for example my ebooks are providing about 1MB each, I'm not sure how well is this indexed and sorted later for effective retrieval, but if it is just matching any word, accuracy will suffer at the mercy of recall.