pipe420.com
robots.txt

Robots Exclusion Standard data for pipe420.com

Resource Scan

Scan Details

Site Domain pipe420.com
Base Domain pipe420.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-23T06:42:40+00:00
Next Scan 2025-10-30T06:42:40+00:00

Last Successful Scan

Scanned2025-09-21T23:17:53+00:00
URL https://pipe420.com/robots.txt
Domain IPs 104.21.83.197, 172.67.181.7, 2606:4700:3037::6815:53c5, 2606:4700:3037::ac43:b507
Response IP 172.67.181.7
Found Yes
Hash 6580e3e685a63c3555f352d3eaaef0ccdbd56b84bf366a9c8261e101fd949a1c
SimHash 4e74dd8266a1

Groups

googlebot

Rule Path
Disallow

bingbot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

duckduckbot

Rule Path
Disallow

slurp

Rule Path
Disallow

facebookexternalhit

Rule Path
Disallow

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /cgi-bin/
Disallow /trackback/
Disallow /xmlrpc.php
Disallow /?s=
Disallow /?attachment_id=

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://pipe420.com/sitemap_index.xml

Comments

  • Allow essential bots
  • Block aggressive or non-essential crawlers
  • Disallow sensitive WordPress directories for all other bots
  • Optional: Add crawl delay to reduce server load
  • Sitemap (replace with your actual sitemap URL)