pcom.edu
robots.txt

Robots Exclusion Standard data for pcom.edu

Resource Scan

Scan Details

Site Domain pcom.edu
Base Domain pcom.edu
Scan Status Ok
Last Scan2024-10-26T13:44:23+00:00
Next Scan 2024-11-25T13:44:23+00:00

Last Scan

Scanned2024-10-26T13:44:23+00:00
URL https://pcom.edu/robots.txt
Redirect https://www.pcom.edu/robots.txt
Redirect Domain www.pcom.edu
Redirect Base pcom.edu
Domain IPs 205.174.26.144
Redirect IPs 205.174.26.144
Response IP 205.174.26.144
Found Yes
Hash ec66ec9c7b3de536e5ad0edd0da6d98f864e6bdb97b893ee737467936ac22143
SimHash 234d19e21799

Groups

*

Rule Path
Allow /
Allow /sitemap.xml
Disallow /_demo/
Disallow /_showcase/
Disallow /_test/
Disallow /zz-test/
Disallow /facstf/
Disallow *.inc$
Disallow *_props.html$
Disallow /*.xsl
Disallow /*.xml
Disallow /*.php
Disallow /404.html
Disallow *index-1.html
Disallow /*?utm

Other Records

Field Value
sitemap https://www.pcom.edu/sitemap.xml

Comments

  • ANY CHANGES OR UPDATES TO robots.txt NEEDS TO BE PUBLISHED DIRECTLY TO PRODUCTION AND NOT STAGING.
  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449