egress.com
robots.txt

Robots Exclusion Standard data for egress.com

Resource Scan

Scan Details

Site Domain egress.com
Base Domain egress.com
Scan Status Ok
Last Scan2024-10-26T05:18:50+00:00
Next Scan 2024-11-25T05:18:50+00:00

Last Scan

Scanned2024-10-26T05:18:50+00:00
URL https://egress.com/robots.txt
Redirect https://www.egress.com/robots.txt
Redirect Domain www.egress.com
Redirect Base egress.com
Domain IPs 13.107.246.59
Redirect IPs 13.107.246.59, 2620:1ec:bdf::59
Response IP 13.107.246.57
Found Yes
Hash 1b115c1f76a8534cb7561bb1e5d9183a758ab9d3e15cf8af10bd9e5cbaa59d9d
SimHash 4b001a7f5fb2

Groups

*

Rule Path
Allow /DependencyHandler.axd
Disallow /aspnet_client/
Disallow /bin/
Disallow /config/
Disallow /umbraco/
Disallow /umbraco_client/
Disallow /usercontrols/
Disallow /*.axd
Disallow *?lang*
Disallow *error.html*
Disallow *search?*
Disallow *?author*
Disallow *?mkt_tok*
Disallow /egress-in-the-news/
Disallow /en-us/egress-in-the-news/
Disallow /media/
Disallow /*.pdf

Other Records

Field Value
sitemap https://www.egress.com/xmlsitemap
sitemap https://www.egress.com/en-us/xmlsitemap