-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to resolve onion hostnames when using http proxy #443
Comments
As a possible temporary workaround, I've finally been able to get a proxychains-ng configuration working on my system. Under proxychains, wpull seems to behave properly with onion URLs. As my original intentions were to get my onion archive working under grab-site, I may look to see if this is a solution which may allow onion links to work with grab-site. Proxies seem to be broken in grab-site anyways, so it is possible proxychains works as a temporary workaround there as well.
|
grab-site currently uses the ludios_wpull fork, not this repo. I have no idea what version "2.1.15" is supposed to be; ludios_wpull is using version numbers of "3.0.x".
|
Ahh yeah that must've been a mix-up, sorry. I ran a multitude of tests yesterday so it is very possible I wasn't actually inside the container when I ran Not realizing there was a Using built-in proxies:
Output: http-proxy_onion_test.log Using proxychains:
Output: proxychains_onion_test.log Extra test (for sanity), using built-in proxies against a clearnet site:
Output: http-proxy_clearnet_test.log I did notice that the DNS lookup addresses reported by wpull differ between the tests. As can see in proxychains_onion_test.log, line 282 shows the lookup address is reported as the hostname provided to wpull, However when passing http-proxies to wpull, line 285 shows the lookup address as the IP for the proxy I passed, in this case After taking a look at the source of the generic error, |
grab-site caught my eye, so I've begun the process of attempting to move my onion archive project over to grab-site, rather than a bunch of wget scripts. I think wpull is great, and just the replacement I was looking for, however I am having a hard time getting wpull to play nicely with my proxies and resolve onion hostnames.
Thanks to a wonderful project called multitor, I have setup an http proxy that acts as a gateway for Tor connections. This has allowed me to simply set
http_proxy
andhttps_proxy
, run my wget scripts against .onion URLs, and archive them like any other website.What I expect: Utilize the
--http-proxy
and--https-proxy
options to set my http proxies, run my wpull scripts against .onion URLs, and archive them like any other website.What happened: Upon running wpull against a .onion URL and http proxies passed, wpull was unable to fetch any of the requests.
Wpull command:
OS: Tested across multiple environments. The first running Debian 9, and the second within the grab-site docker container.
Python version: Python 3.6.4 on Debian 9, Python 3.7.3 within the grab-site container.
Wpull version: v2.0.3 on Debian 9,
v2.1.15v3.0.7 appears to be the version of wpull inside grab-site.In one instance, I reverted to v1.2.3 as it seemed v2.0's network stack had "various other problems that did not exist in 1.2.3" -issue#406
Log/Output:
The text was updated successfully, but these errors were encountered: