Skip to content

with meilisearch:v1.15.2 the doc-scrapping is failing #478

@ananta-code

Description

@ananta-code

Description
with meilisearch:v1.15.2 the doc-scrapping is failing with error:

Docs-Scraper: https://gitdocs.test.com/gitdocs/chaoseng/prod/test-report.html 72 records)
JSONDecodeError while adding documents for https://gitdocs.test.com/gitdocs/testone/prod/tech-acceptance.html: Expecting value: line 1 column 1 (char 0)
First record: {'anchor': None, 'content': 'Overview', 'type': 'content', 'tags': [], 'url': '/gitdocs/testone/prod/tech-acceptance.html', 'url_without_variables': '/gitdocs/testone/prod/tech-acceptance.html', 'page_status': None, 'url_without_anchor': '/gitdocs/testone/prod/tech-acceptance.html', 'no_variables': 'True', 'objectID': '25e3aae1a36a3bdf95e8087dde245beff6aba378', 'page_rank': 0, 'level': 0, 'position': 1, 'hierarchy_lvl0': 'Check-in Open Systems :: Architecture / Design Guidelines :: Technical Acceptance Checklist', 'hierarchy_lvl1': 'Page title', 'hierarchy_lvl2': None, 'hierarchy_lvl3': None, 'hierarchy_lvl4': None, 'hierarchy_lvl5': None, 'hierarchy_lvl6': None, 'hierarchy_radio_lvl0': None, 'hierarchy_radio_lvl1': None}

the above error comes when i put the debugger in add_records method in meillisearch_helper.py.

Current version we have is 0.12.12

everything works well in getmeili/meilisearch:v1.7.6 instance

Expected behavior
doc-scrapping should pass
Current behavior
it is giving error
Screenshots or Logs
If applicable, add screenshots or logs to help explain your problem.

Environment (please complete the following information):

  • OS: [e.g. Debian GNU/Linux]
  • Meilisearch version: [e.g. v.1.15.2.]
  • docs-scraper version: [e.g v. 0.12.12]

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workinggood first issueGood for newcomers

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions