thelondoner.me
robots.txt

Robots Exclusion Standard data for thelondoner.me

Archived Snapshots

Resource Scan

Scan Details

Site Domain	thelondoner.me
Base Domain	thelondoner.me
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-05-19T08:07:30+00:00
Next Scan	2024-08-17T08:07:30+00:00

Last Successful Scan

Scanned	2023-01-05T02:51:02+00:00
URL	https://thelondoner.me/robots.txt
Domain IPs	104.26.8.130, 104.26.9.130, 172.67.70.116, 2606:4700:20::681a:882, 2606:4700:20::681a:982, 2606:4700:20::ac43:4674
Response IP	104.26.8.130
Found	Yes
Hash	47b2e95513a93aa55cdc09ed2c5cb042b9256349f15333743081f37fb1f00c82
SimHash	201cdd02a692

Groups

*

Rule	Path
Disallow	/wp-admin/
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/wp-admin/

Allow

/wp-admin/admin-ajax.php

irlbot

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou spider

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks-robot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

ezooms robot

Rule	Path
Disallow	/

Rule

Path

Disallow

ia_archiver

Rule	Path
Disallow	/

Rule

Path

Disallow

perl lwp

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

netestate ne crawler (+http://www.website-datenbank.de/)

Rule	Path
Disallow	/

Rule

Path

Disallow

searchmetricsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider
baiduspider-video
baiduspider-image

Rule	Path
Disallow	/

Rule

Path

Disallow

youdaobot

Rule	Path
Disallow	/

Rule

Path

Disallow

megaindex.ru/2.0

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.thelondoner.me/sitemap_index.xml

Field

Value

sitemap

https://www.thelondoner.me/sitemap_index.xml

thelondoner.merobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

irlbot

sogou spider

sogou

semrushbot

semrushbot-sa

seokicks-robot

blexbot

sistrix crawler

ezooms robot

ia_archiver

perl lwp

blexbot

netestate ne crawler (+http://www.website-datenbank.de/)

searchmetricsbot

baiduspiderbaiduspider-videobaiduspider-image

youdaobot

megaindex.ru/2.0

dotbot

Other Records

thelondoner.me
robots.txt

baiduspider
baiduspider-video
baiduspider-image