thelondoner.me
robots.txt

Robots Exclusion Standard data for thelondoner.me

Resource Scan

Scan Details

Site Domain thelondoner.me
Base Domain thelondoner.me
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-19T08:07:30+00:00
Next Scan 2024-08-17T08:07:30+00:00

Last Successful Scan

Scanned2023-01-05T02:51:02+00:00
URL https://thelondoner.me/robots.txt
Domain IPs 104.26.8.130, 104.26.9.130, 172.67.70.116, 2606:4700:20::681a:882, 2606:4700:20::681a:982, 2606:4700:20::ac43:4674
Response IP 104.26.8.130
Found Yes
Hash 47b2e95513a93aa55cdc09ed2c5cb042b9256349f15333743081f37fb1f00c82
SimHash 201cdd02a692

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

irlbot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

ezooms robot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

perl lwp

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

netestate ne crawler (+http://www.website-datenbank.de/)

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thelondoner.me/sitemap_index.xml