italianways.com
robots.txt

Robots Exclusion Standard data for italianways.com

Resource Scan

Scan Details

Site Domain italianways.com
Base Domain italianways.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-05-18T07:59:01+00:00
Next Scan 2024-08-16T07:59:01+00:00

Last Successful Scan

Scanned2023-06-29T19:27:04+00:00
URL https://www.italianways.com/robots.txt
Domain IPs 104.21.13.165, 172.67.156.204, 2606:4700:3030::ac43:9ccc, 2606:4700:3033::6815:da5
Response IP 172.67.156.204
Found Yes
Hash 67161fbba21ad5d4286a8a532d515882cf2dac25e91c194b6fb2f3b2ded62c3c
SimHash 31489a588333

Groups

*

Rule Path
Allow wday/asset/ui-html/**/workdayApp.min.js
Allow wday/asset/uic-shared-vendors/**/shared-vendors.min.js
Allow wday/asset/ui-html/**/shared-min.js
Allow wday/asset/candidate-experience-*/**/*.js

Comments

  • Disables (compliant) crawler indexing

Warnings

  • `noindex` is not a known field.