huckleberry.xenite.org
robots.txt

Robots Exclusion Standard data for huckleberry.xenite.org

Resource Scan

Scan Details

Site Domain huckleberry.xenite.org
Base Domain xenite.org
Scan Status Ok
Last Scan2024-10-03T01:34:31+00:00
Next Scan 2024-10-10T01:34:31+00:00

Last Scan

Scanned2024-10-03T01:34:31+00:00
URL https://huckleberry.xenite.org/robots.txt
Domain IPs 192.96.218.79
Response IP 192.96.218.79
Found Yes
Hash b03c7c3462b38be13ecdc6bdb113f45fec74f314d31c8006d403eea775d44f54
SimHash 5b2edbd16455

Groups

baiduspider
bingbot
duckduckbot
exabot
facebookexternalhit
feedfetcher-google
google-inspectiontool
google-site-verification
google-speakr
googlebot
googlebot-image
googlebot-news
googlebot-video
mediapartners-google
msnbot
qwantify
speedyspider
twitterbot
yandexantivirus/2.0
yandexbot/3.0
yandeximageresizer/2.0
yandeximages/3.0
yandexmedia/3.0
yandexpagechecker/1.0
yandexwebmaster/2.0
yandexzakladki/3.0

Rule Path
Disallow /?s
Disallow *pw_post_layout/*
Disallow /cgi-bin/
Disallow /go/
Disallow /link/
Disallow /show/
Disallow /visit/
Disallow /wp-admin/

ahrefsbot
ccbot/2.0
chatgpt-user
google-cloudvertexbot
google-extended
googleother
mj12bot
mojeekbot
petalbot
semrushbot
similarweb
yepbot
youbot
*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://huckleberry.xenite.org/huckleberry-sitemap.txt

Comments

  • This Alternate robots.txt file was created by the Alternate Robots.txt WordPress plugin.
  • robots.txt for https://huckleberry.xenite.org/
  • Allow the good bots in
  • Block the well-behaved unwanted bots