pac.org
robots.txt

Robots Exclusion Standard data for pac.org

Resource Scan

Scan Details

Site Domain pac.org
Base Domain pac.org
Scan Status Ok
Last Scan2025-12-15T01:47:13+00:00
Next Scan 2026-01-14T01:47:13+00:00

Last Scan

Scanned2025-12-15T01:47:13+00:00
URL https://pac.org/robots.txt
Domain IPs 172.66.41.14, 172.66.42.242, 2606:4700:3108::ac42:290e, 2606:4700:3108::ac42:2af2
Response IP 172.66.42.242
Found Yes
Hash d9abf3f782aeb994fa70f6a551c94750c41054317defda12b3bb15f0d53ef7f7
SimHash 380c724eceb1

Groups

twitterbot

Rule Path
Disallow

*

Rule Path
Disallow /domains/
Disallow /members/
Disallow /private/
Disallow /files/private/
Disallow /mobile/
Disallow /out.php
Disallow /ete/
Disallow /conferences/pats12/ete
Disallow /*/ete
Disallow /webinar_recordings/
Disallow /p/
Disallow /wp-activate.php
Disallow /wp-app.php
Disallow /wp-blog-header.php
Disallow /wp-comments-post.php
Disallow /wp-config-sample.php
Disallow /wp-config.php
Disallow /wp-cron.php
Disallow /wp-links-opml.php
Disallow /wp-load.php
Disallow /wp-login.php
Disallow /wp-mail.php
Disallow /wp-settings.php
Disallow /wp-signup.php
Disallow /wp-trackback.php
Disallow /wp-admin/
Disallow /wp-includes/

Comments

  • Directories
  • Files
  • Paths (clean URLs)