/.well-known/

Log In Sign Up

topgear.com
robots.txt

Robots Exclusion Standard data for topgear.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	topgear.com
Base Domain	topgear.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-09-03T18:33:36+00:00
Next Scan	2024-12-02T18:33:36+00:00

Last Successful Scan

Scanned	2023-11-08T20:22:45+00:00
URL	https://topgear.com/robots.txt
Redirect	https://www.topgear.com/robots.txt
Redirect Domain	www.topgear.com
Redirect Base	topgear.com
Domain IPs	18.201.0.224, 52.212.159.195, 52.213.182.76
Redirect IPs	184.50.85.154, 2600:1413:b000:1c::17d1:2eda, 2600:1413:b000:1c::17d1:2ee3, 96.17.180.24
Response IP	184.50.85.154
Found	Yes
Hash	8ce312833e586293cee8cd29e8e31137e07bd4a87c5674b7ea738ee23a0b88b7
SimHash	a8109d1bc774

Groups

*

Rule

Path

Allow

/

Other Records

Field

Value

crawl-delay

10

ahrefsbot

Rule

Path

Disallow

/

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
Block specific User Agents

Back to top