learnbuddhism.org
robots.txt

Robots Exclusion Standard data for learnbuddhism.org

Resource Scan

Scan Details

Site Domain learnbuddhism.org
Base Domain learnbuddhism.org
Scan Status Ok
Last Scan2025-11-12T09:45:16+00:00
Next Scan 2025-12-12T09:45:16+00:00

Last Scan

Scanned2025-11-12T09:45:16+00:00
URL https://learnbuddhism.org/robots.txt
Domain IPs 104.21.57.126, 172.67.163.225, 2606:4700:3032::6815:397e, 2606:4700:3034::ac43:a3e1
Response IP 104.21.57.126
Found Yes
Hash 5de8a7bb3bd29df98ac3bec26fbe9505f071a4d0ff901a584652b8cda3965b36
SimHash 68029bd24ef1

Groups

*

Rule Path
Allow /
Disallow /*?m=
Disallow /dq/
Disallow /gr/
Disallow /co/
Disallow /gq/
Disallow /af/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://learnbuddhism.org/sitemap.xml

Comments

  • Robots.txt for learnbuddhism.org
  • Block only confirmed spam patterns
  • Block tracking parameters
  • Disallow: /*?fbclid=
  • Disallow: /*?utm_
  • Block port access
  • Disallow: /*:2083
  • Disallow: /*:2096