cleanlink.com
robots.txt

Robots Exclusion Standard data for cleanlink.com

Resource Scan

Scan Details

Site Domain cleanlink.com
Base Domain cleanlink.com
Scan Status Ok
Last Scan2024-05-04T13:50:44+00:00
Next Scan 2024-06-03T13:50:44+00:00

Last Scan

Scanned2024-05-04T13:50:44+00:00
URL https://cleanlink.com/robots.txt
Domain IPs 40.86.89.230
Response IP 40.86.89.230
Found Yes
Hash 64ea0c53961561ae669336900d2eeb0936f7ea52043b43681bc03f741d3fe8bf
SimHash 0b31000b4fe6

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /_mm/
Disallow /_notes/
Disallow /_baks/
Disallow /MMWIP/
Allow /resources/editorial/
Allow /resources/education/
Allow /resources/makreting/
Disallow /resources/
Disallow /emails/
Disallow /virtual/
Disallow /ProductAwards/
Disallow /go/
Disallow /App_Code/
Disallow /App_Components/
Disallow /test/
Disallow /sample/
Disallow *.csi

Comments

  • robots.txt for http://www.cleanlink.com/