jw.org
robots.txt
Robots Exclusion Standard data for jw.org
Resource Scan
Scan Details
Site Domain | jw.org |
Base Domain | jw.org |
Scan Status | Ok |
Last Scan | 2024-11-02T13:53:16+00:00 |
Next Scan | 2024-11-16T13:53:16+00:00 |
Last Scan
Scanned | 2024-11-02T13:53:16+00:00 |
URL | https://jw.org/robots.txt |
Redirect | https://www.jw.org/robots.txt |
Redirect Domain | www.jw.org |
Redirect Base | jw.org |
Domain IPs | 173.222.144.81 |
Redirect IPs | 104.83.198.205 |
Response IP | 184.25.222.205 |
Found | Yes |
Hash | b2dfe8d975d692adf02d99fe924653ee3c23cf649a70e4be1613a1533926625a |
SimHash | ad35584ccd53 |
Groups
*
Rule | Path |
---|---|
Disallow | /apps/ |
Disallow | /*?*contentLanguageFilter= |
Disallow | /*?*pubFilter= |
Disallow | /*?*sortBy= |
Disallow | /*?*start=0 |
Disallow | /*?*tourl= |
Disallow | /*/choose-language |
Other Records
Field | Value |
---|---|
sitemap | https://www.jw.org/sitemap.xml |