Skip to content

Unknown connection error when downloading from wayback #19

@thusithaC

Description

@thusithaC

Hi,

First of all thank for open sourcing this great tool.
I have been trying to learn the syntax and start playing around with archives using the "Downloading_WARC_from_Wayback" notebook using the Docker image in host network mode.

I observe some weird error messages in the process, and hope you could shed some light as to what might be happening.

[Stage 18:===============================================>     (449 + 16) / 500]java.net.ConnectException: Connection refused (Connection refused)
	at java.net.PlainSocketImpl.socketConnect(Native Method)
	at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350)
	at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206)
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188)
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
	at java.net.Socket.connect(Socket.java:589)
	at java.net.Socket.connect(Socket.java:538)
	at sun.net.NetworkClient.doConnect(NetworkClient.java:180)
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:463)
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:558)```

However, the download seems to be progressing, even though it is very slow and time consuming. (Not that I have limited the download to 500 pages). 
Is this error due to a environment setup issue? Can this be ignored?

Thanks,
TNC  

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions