carriere.com
robots.txt

Robots Exclusion Standard data for carriere.com

Resource Scan

Scan Details

Site Domain carriere.com
Base Domain carriere.com
Scan Status Ok
Last Scan2024-11-14T20:41:06+00:00
Next Scan 2024-11-28T20:41:06+00:00

Last Scan

Scanned2024-11-14T20:41:06+00:00
URL https://carriere.com/robots.txt
Redirect https://www.carriere.com/robots.txt
Redirect Domain www.carriere.com
Redirect Base carriere.com
Domain IPs 62.165.72.236
Redirect IPs 62.165.72.236
Response IP 62.165.72.236
Found Yes
Hash d7160370fdf1a715c7f77f8c8c647dc2cc9395d929cbd40fdc790e6dc5a04828
SimHash 61082863bf10

Groups

*

Rule Path
Disallow /aspnet_client/
Disallow /bin/
Disallow /config/
Disallow /data/
Disallow /install/
Disallow /macroScripts/
Disallow /masterpages/
Disallow /umbraco/
Disallow /umbraco_client/
Disallow /usercontrols/
Disallow /xslt/
Disallow /*?*
Allow /*?from=
Allow /*?page=
Allow /*?v=
Allow /*?currentpage=
Allow /media/*

Other Records

Field Value
sitemap https://www.carriere.com/sitemap.xml