sig.com
robots.txt
Robots Exclusion Standard data for sig.com
Resource Scan
Scan Details
Site Domain | sig.com |
Base Domain | sig.com |
Scan Status | Ok |
Last Scan | 2024-10-29T06:45:33+00:00 |
Next Scan | 2024-11-28T06:45:33+00:00 |
Last Scan
Scanned | 2024-10-29T06:45:33+00:00 |
URL | https://sig.com/robots.txt |
Domain IPs | 162.159.140.127, 172.66.0.125 |
Response IP | 172.66.0.125 |
Found | Yes |
Hash | a67a9a530b475d0edcd3f013cc45cdb43557c25aa85e91e2743b616e16ce3bb9 |
SimHash | 71451410cd90 |
Groups
*
Rule | Path |
---|---|
Disallow | /404 |
Disallow | /searchsite/ |
Disallow | /extra/ |
Disallow | /documents/ |
Other Records
Field | Value |
---|---|
sitemap | dayinthelife.sig.com/sitemap.xml |
sitemap | sig.com/sitemap.xml |