thejournal.com
robots.txt

Robots Exclusion Standard data for thejournal.com

Resource Scan

Scan Details

Site Domain thejournal.com
Base Domain thejournal.com
Scan Status Ok
Last Scan2025-03-18T18:16:10+00:00
Next Scan 2025-04-17T18:16:10+00:00

Last Scan

Scanned2025-03-18T18:16:10+00:00
URL https://thejournal.com/robots.txt
Domain IPs 172.66.40.201, 172.66.43.55, 2606:4700:3108::ac42:28c9, 2606:4700:3108::ac42:2b37
Response IP 172.66.40.201
Found Yes
Hash 2fc875c31136ba21dfb61ea9a8fa1536da4d6661102b5ce487b1f04b5c3e8fc8
SimHash 30485f654fd1

Groups

*

Rule Path
Disallow /WebResource.axd
Disallow /ScriptResource.axd
Disallow /App_Browsers/
Disallow /App_Config/
Disallow /App_Data/
Disallow /aspnet_client/
Disallow /bin/
Disallow /bin_install/
Disallow /Coremetrics/
Disallow /data/
Disallow /indexes/
Disallow /layouts/
Disallow /seotoolkit/
Disallow /sitecore/
Disallow /temp/
Disallow /upload/
Disallow /WebServices/
Disallow /xsl/
Disallow /AAMALL
Disallow /layouts/masters/system/templates/

Other Records

Field Value
sitemap https://thejournal.com/full/THE_Journal.xml
sitemap https://thejournal.com/news/THE_Journal.xml

Warnings

  • 1 invalid line.