carriera.aldi.it
robots.txt

Robots Exclusion Standard data for carriera.aldi.it

Resource Scan

Scan Details

Site Domain carriera.aldi.it
Base Domain aldi.it
Scan Status Ok
Last Scan2025-06-01T21:03:47+00:00
Next Scan 2025-07-01T21:03:47+00:00

Last Scan

Scanned2025-06-01T21:03:47+00:00
URL https://carriera.aldi.it/robots.txt
Domain IPs 80.243.175.23
Response IP 80.243.175.23
Found Yes
Hash ef448c3e61acf146c38c9523e85fe6433dc6db49f5f639a401d40cb7234eedf1
SimHash 994978d6af82

Groups

*

Rule Path
Disallow /*/Private/*
Disallow /*/Configuration/*
Disallow /fileadmin/template/
Disallow /fileadmin/templates/
Disallow /typo3/
Disallow /typo3_src/
Disallow /vendor/
Disallow /typo3temp/
Disallow /typo3conf/
Disallow /stats/
Disallow /error/
Disallow /error_logs/
Allow /typo3/sysext/frontend/Resources/Public/*
Allow /typo3conf/ext/
Allow /typo3temp/*.css
Allow /typo3temp/*.css.*.gzip
Allow /typo3temp/*.js
Allow /typo3temp/*.js.*.gzip
Allow /typo3temp/*.jpg
Allow /typo3temp/*.gif
Allow /typo3temp/*.png

Other Records

Field Value
sitemap https://carriera.aldi.it/sitemap.xml

Comments

  • Should always be protected (.htaccess)