thejournal.com
robots.txt

Robots Exclusion Standard data for thejournal.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	thejournal.com
Base Domain	thejournal.com
Scan Status	Ok
Last Scan	2025-03-18T18:16:10+00:00
Next Scan	2025-04-17T18:16:10+00:00

Last Scan

Scanned	2025-03-18T18:16:10+00:00
URL	https://thejournal.com/robots.txt
Domain IPs	172.66.40.201, 172.66.43.55, 2606:4700:3108::ac42:28c9, 2606:4700:3108::ac42:2b37
Response IP	172.66.40.201
Found	Yes
Hash	2fc875c31136ba21dfb61ea9a8fa1536da4d6661102b5ce487b1f04b5c3e8fc8
SimHash	30485f654fd1

Groups

*

Rule	Path
Disallow	/WebResource.axd
Disallow	/ScriptResource.axd
Disallow	/App_Browsers/
Disallow	/App_Config/
Disallow	/App_Data/
Disallow	/aspnet_client/
Disallow	/bin/
Disallow	/bin_install/
Disallow	/Coremetrics/
Disallow	/data/
Disallow	/indexes/
Disallow	/layouts/
Disallow	/seotoolkit/
Disallow	/sitecore/
Disallow	/temp/
Disallow	/upload/
Disallow	/WebServices/
Disallow	/xsl/
Disallow	/AAMALL
Disallow	/layouts/masters/system/templates/

Rule

Path

Disallow

/WebResource.axd

Disallow

/ScriptResource.axd

Disallow

/App_Browsers/

Disallow

/App_Config/

Disallow

/App_Data/

Disallow

/aspnet_client/

Disallow

/bin/

Disallow

/bin_install/

Disallow

/Coremetrics/

Disallow

/data/

Disallow

/indexes/

Disallow

/layouts/

Disallow

/seotoolkit/

Disallow

/sitecore/

Disallow

/temp/

Disallow

/upload/

Disallow

/WebServices/

Disallow

/xsl/

Disallow

/AAMALL

Disallow

/layouts/masters/system/templates/

Back to top

Other Records

Field	Value
sitemap	https://thejournal.com/full/THE_Journal.xml
sitemap	https://thejournal.com/news/THE_Journal.xml

Field

Value

sitemap

https://thejournal.com/full/THE_Journal.xml

sitemap

https://thejournal.com/news/THE_Journal.xml

Back to top

Warnings

1 invalid line.

Back to top

thejournal.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Warnings

thejournal.com
robots.txt