Skip to content

"EOFError: end of file reached" during manual chunking #63

@khaledh

Description

@khaledh

We're hitting an issue when an object is backfilled in Salesforce, causing 13M+ records to be retrieved on the next extraction (i.e. similar to a full import). The job uses manual chunking, and fails at the first step when it's querying all IDs in preparation for chunking (as opposed to querying the actual data). See the stack trace below.

From the logs it seems that the batch for fetching the IDs succeeds, but when actually fetching the CSV list of IDs, it takes about 160 seconds before failing with EOFError: end of file reached, which is caused by IOError: Connection reset by peer. It sounds like Salesforce is terminating the connection? I tried to find docs on the default request timeout when retrieving results, but couldn't find it.

The CSV response is about 340MB uncompressed, and when manually downloaded (through the Bulk Data Load Jobs UI) takes only several seconds to download. Not sure why it takes over 2.5 minutes to download via the gem.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions