Skip to content

Content of new files (eg PDF) doesnt get indexed, Nextcloud AiO Docker - fresh Installation #911

@vitalymp

Description

@vitalymp

Hi,

I’m using the Docker-based Nextcloud AIO installation with Elasticsearch and the Fulltextsearch app. After the initial setup, fulltext search—including PDF content indexing—worked as expected. I was able to find both filenames and text content from within PDF files. The settings are configured to index PDF content, and the text inside the PDFs is machine-readable (not just scanned images).

However, after adding new files, I noticed that only the filenames of these new files are indexed and searchable. The actual text content from within new PDFs is not found in search results. Older files (added right after setup) are still fully searchable by content.

To summarize:

  • Initial PDF content indexing worked, including searching inside PDF files.
    
  • For newly added files, only filenames are indexed, not the content.
    
  • Settings for PDF content indexing are unchanged and correct.
    
  • PDFs are machine-readable (not just images).
    
  • No relevant errors are shown in the logs.
    

Is this a known issue? Are there any recommended troubleshooting steps or configuration checks for this situation?

upate: Fulltextsearch in TXT Files does work - not with PDF Files

Thank you!

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions