tetrapak.com
robots.txt

Robots Exclusion Standard data for tetrapak.com

Resource Scan

Scan Details

Site Domain tetrapak.com
Base Domain tetrapak.com
Scan Status Ok
Last Scan2024-11-02T01:09:56+00:00
Next Scan 2024-12-02T01:09:56+00:00

Last Scan

Scanned2024-11-02T01:09:56+00:00
URL https://tetrapak.com/robots.txt
Redirect https://www.tetrapak.com/robots.txt
Redirect Domain www.tetrapak.com
Redirect Base tetrapak.com
Domain IPs 13.248.160.137
Redirect IPs 138.113.125.53, 138.113.21.174, 138.113.21.246, 163.171.208.133, 163.171.209.213, 203.117.159.15
Response IP 138.113.125.53
Found Yes
Hash 15af881fd7c079b4c067d7593308132dc9633b3db52db84efa13a94fa6ef3791
SimHash 4048dc57c792

Groups

onetrustbot

Rule Path
Allow /

*

Rule Path
Disallow /*/error/*

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.tetrapak.com/sitemap.xml

Comments

  • Disallow paths for Googlebot crawler
  • User-agent: Googlebot
  • Disallow: /nogooglebot/
  • Allow OneTrustBot for All crawler
  • Disallow error path for All crawler
  • Sitemap Index Path