thejournal.org
robots.txt

Robots Exclusion Standard data for thejournal.org

Resource Scan

Scan Details

Site Domain thejournal.org
Base Domain thejournal.org
Scan Status Ok
Last Scan2026-02-07T20:42:21+00:00
Next Scan 2026-02-14T20:42:21+00:00

Last Scan

Scanned2026-02-07T20:42:21+00:00
URL https://thejournal.org/robots.txt
Redirect https://www.thejournal.org/robots.txt
Redirect Domain www.thejournal.org
Redirect Base thejournal.org
Domain IPs 23.111.181.26
Redirect IPs 23.111.181.26
Response IP 23.111.181.26
Found Yes
Hash d94f859580ef307ff01a36bf70528ebc92c0fd7a73fc2896b08df74cad904a01
SimHash 29058908cd92

Groups

*

Rule Path
Disallow /test/
Disallow /cgi-bin/
Disallow /protect/

mediapartners-google

Rule Path
Disallow

Other Records

Field Value
sitemap http://www.thejournal.org/sitemap.xml