workalpha.com
robots.txt
Robots Exclusion Standard data for workalpha.com
Resource Scan
Scan Details
Site Domain | workalpha.com |
Base Domain | workalpha.com |
Scan Status | Ok |
Last Scan | 2024-09-24T23:10:36+00:00 |
Next Scan | 2024-10-01T23:10:36+00:00 |
Last Scan
Scanned | 2024-09-24T23:10:36+00:00 |
URL | https://workalpha.com/robots.txt |
Domain IPs | 104.21.55.198, 172.67.172.162, 2606:4700:3033::6815:37c6, 2606:4700:3037::ac43:aca2 |
Response IP | 104.21.55.198 |
Found | Yes |
Hash | 04fae2f33232f9e96f77cd93c90300583a2a38ec3cc248b7a310858874414ad3 |
SimHash | 79495445ce93 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /wp-content/plugins/ |
Disallow | /wp-content/themes/ |
Disallow | /feed/ |
Disallow | */feed/ |
Other Records
Field | Value |
---|---|
sitemap | http://workalpha.com/sitemap_index.xml |