semioffice.com
robots.txt

Robots Exclusion Standard data for semioffice.com

Resource Scan

Scan Details

Site Domain semioffice.com
Base Domain semioffice.com
Scan Status Ok
Last Scan2024-11-06T00:39:53+00:00
Next Scan 2024-11-13T00:39:53+00:00

Last Scan

Scanned2024-11-06T00:39:53+00:00
URL https://semioffice.com/robots.txt
Domain IPs 66.96.147.101
Response IP 66.96.147.101
Found Yes
Hash f54fdc975ab235aa1c78b3081c46cff862e8efa38955af77b89c7fb908b4ce74
SimHash 3c05d833c6d0

Groups

*

Rule Path
Disallow /private/
Disallow /admin/
Disallow /tmp/
Disallow /scripts/
Allow /
Disallow /*.pdf$

Other Records

Field Value
sitemap https://www.semioffice.com/sitemap.xml

Comments

  • Prevent indexing of specific file types
  • Sitemap location