/.well-known/

Log In Sign Up

itn.co.uk
robots.txt

Robots Exclusion Standard data for itn.co.uk

Archived Snapshots

Resource Scan

Scan Details

Site Domain	itn.co.uk
Base Domain	itn.co.uk
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-06-16T02:39:49+00:00
Next Scan	2024-09-14T02:39:49+00:00

Last Successful Scan

Scanned	2023-01-29T08:16:52+00:00
URL	https://itn.co.uk/robots.txt
Redirect	https://www.itn.co.uk/robots.txt
Redirect Domain	www.itn.co.uk
Redirect Base	itn.co.uk
Domain IPs	13.41.8.159, 18.168.233.230, 3.11.93.42
Redirect IPs	13.41.8.159, 18.168.233.230, 3.11.93.42
Response IP	13.41.8.159
Found	Yes
Hash	ec4ee7f2b92724a91baad7d22038200cffb4f755f77cb9b807ac4ae485596b3b
SimHash	b8129d0b4564

Groups

*

Rule

Path

Allow

/

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html

Back to top