thejournal.com
robots.txt
Robots Exclusion Standard data for thejournal.com
Resource Scan
Scan Details
Site Domain | thejournal.com |
Base Domain | thejournal.com |
Scan Status | Ok |
Last Scan | 2025-03-18T18:16:10+00:00 |
Next Scan | 2025-04-17T18:16:10+00:00 |
Last Scan
Scanned | 2025-03-18T18:16:10+00:00 |
URL | https://thejournal.com/robots.txt |
Domain IPs | 172.66.40.201, 172.66.43.55, 2606:4700:3108::ac42:28c9, 2606:4700:3108::ac42:2b37 |
Response IP | 172.66.40.201 |
Found | Yes |
Hash | 2fc875c31136ba21dfb61ea9a8fa1536da4d6661102b5ce487b1f04b5c3e8fc8 |
SimHash | 30485f654fd1 |
Groups
*
Rule | Path |
---|---|
Disallow | /WebResource.axd |
Disallow | /ScriptResource.axd |
Disallow | /App_Browsers/ |
Disallow | /App_Config/ |
Disallow | /App_Data/ |
Disallow | /aspnet_client/ |
Disallow | /bin/ |
Disallow | /bin_install/ |
Disallow | /Coremetrics/ |
Disallow | /data/ |
Disallow | /indexes/ |
Disallow | /layouts/ |
Disallow | /seotoolkit/ |
Disallow | /sitecore/ |
Disallow | /temp/ |
Disallow | /upload/ |
Disallow | /WebServices/ |
Disallow | /xsl/ |
Disallow | /AAMALL |
Disallow | /layouts/masters/system/templates/ |
Other Records
Field | Value |
---|---|
sitemap | https://thejournal.com/full/THE_Journal.xml |
sitemap | https://thejournal.com/news/THE_Journal.xml |
Warnings
- 1 invalid line.