cylex.ie
robots.txt

Robots Exclusion Standard data for cylex.ie

Resource Scan

Scan Details

Site Domain cylex.ie
Base Domain cylex.ie
Scan Status Ok
Last Scan2024-05-21T08:40:11+00:00
Next Scan 2024-06-20T08:40:11+00:00

Last Scan

Scanned2024-05-21T08:40:11+00:00
URL https://cylex.ie/robots.txt
Redirect https://www.cylex.ie/robots.txt
Redirect Domain www.cylex.ie
Redirect Base cylex.ie
Domain IPs 104.18.6.122, 104.18.7.122, 2606:4700::6812:67a, 2606:4700::6812:77a
Redirect IPs 104.18.6.122, 104.18.7.122, 2606:4700::6812:67a, 2606:4700::6812:77a
Response IP 104.18.6.122
Found Yes
Hash 9cf923e90f4dfe82c23c09a5db732396e2691a4a87b7d9921f8fc0a29174921e
SimHash 2864c566f243

Groups

*

Rule Path
Disallow /google/
Disallow /reviews/
Disallow /webmastertools/
Disallow /fir_news/
Disallow /webservices/
Disallow /Homepage/internet-shops.asp
Disallow /webshop/
Disallow /correct/
Disallow /webservices/
Disallow /ContactForm.aspx
Disallow /ContactForm2.aspx
Disallow /ContactForm.ashx
Disallow /uploadHandler.ashx
Disallow /userContent.ashx
Disallow /korrektur_firmendaten.asp
Disallow /s?
Disallow /ScriptResource.axd
Disallow /combinescriptshandler.axd
Disallow /WebResource.axd
Disallow /info/cookies_policy.html
Disallow /info/terms-and-conditions-customer-reviews.html
Disallow /info/terms-and-conditions-quote-requests.html
Disallow /api/
Disallow /showCompanyMap
Disallow /cdn-cgi/

mediapartners-google

Rule Path
Allow /s?

gptbot

Rule Path
Disallow /