theregister.com
robots.txt

Robots Exclusion Standard data for theregister.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	theregister.com
Base Domain	theregister.com
Scan Status	Ok
Last Scan	2024-10-02T20:56:48+00:00
Next Scan	2024-10-09T20:56:48+00:00

Last Scan

Scanned	2024-10-02T20:56:48+00:00
URL	https://theregister.com/robots.txt
Redirect	https://www.theregister.com/robots.txt
Redirect Domain	www.theregister.com
Redirect Base	theregister.com
Domain IPs	104.18.4.22, 104.18.5.22
Redirect IPs	104.18.4.22, 104.18.5.22
Response IP	104.18.5.22
Found	Yes
Hash	8b142f17d869e0a2b4bb583cffe2ffe673726d202dddec708a5081a7ec6e5edb
SimHash	e9285864e331

Groups

bingbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

slurp

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

magpie-crawler

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

megaindex.ru

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

re-animator

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

*

Rule	Path
Disallow	*/trackback/

Rule

Path

Disallow

*/trackback/

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

theregister.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

bingbot

Other Records

slurp

Other Records

magpie-crawler

Other Records

megaindex.ru

Other Records

re-animator

Other Records

ahrefsbot

Other Records

*

Other Records

theregister.com
robots.txt