funnysentences.com
robots.txt

Robots Exclusion Standard data for funnysentences.com

Resource Scan

Scan Details

Site Domain funnysentences.com
Base Domain funnysentences.com
Scan Status Ok
Last Scan2026-01-26T14:23:28+00:00
Next Scan 2026-02-02T14:23:28+00:00

Last Scan

Scanned2026-01-26T14:23:28+00:00
URL https://funnysentences.com/robots.txt
Domain IPs 104.21.80.236, 172.67.187.166, 2606:4700:3032::ac43:bba6, 2606:4700:3036::6815:50ec
Response IP 172.67.187.166
Found Yes
Hash 1b64eb0a3becea32950e84d723eea3babdee6cc971987338f1b46bd7bff72871
SimHash 600ecd30e581

Groups

*

Rule Path
Allow /
Allow /category/
Allow /topic/
Allow /popular
Allow /privacy
Disallow /api/
Disallow /_next/
Disallow /admin/
Disallow /admin-dashboard
Disallow /test-*
Disallow /debug-*
Disallow /generator
Disallow *.json$

Other Records

Field Value
crawl-delay 1

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 0

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 0

slurp

Rule Path
Allow /

Other Records

Field Value
crawl-delay 0

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://funnysentences.com/sitemap.xml

Comments

  • Allow crawling of main content
  • Prevent crawling of unnecessary routes
  • Crawl delay to be respectful
  • Specific rules for major search engines
  • Block bad bots