paganlibrary.com
robots.txt

Robots Exclusion Standard data for paganlibrary.com

Resource Scan

Scan Details

Site Domain paganlibrary.com
Base Domain paganlibrary.com
Scan Status Ok
Last Scan2026-03-04T11:27:42+00:00
Next Scan 2026-03-11T11:27:42+00:00

Last Scan

Scanned2026-03-04T11:27:42+00:00
URL https://paganlibrary.com/robots.txt
Domain IPs 104.21.2.16, 172.67.128.147
Response IP 172.67.128.147
Found Yes
Hash 4ed76e422e60d7fe567cbf21ed545d95899d83cbcacfb6ba2732e011209d8d8f
SimHash 1d157964e7d0

Groups

*

Rule Path
Disallow /admin/
Disallow /cgi-bin/
Disallow /*.bak$
Disallow /bot-trap/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Comments

  • robots.txt for http://www.paganlibrary.com
  • Directories
  • Disallow: /themes/
  • Files
  • Disallow: /CHANGELOG.txt
  • Paths (clean URLs)
  • Disallow: /admin/
  • Paths (no clean URLs)
  • Disallow: /?q=admin/
  • Disallow: *.xlsx$
  • Begin block Bad-Robots from robots.txt