tetrapak.com
robots.txt

Robots Exclusion Standard data for tetrapak.com

Resource Scan

Scan Details

Site Domain tetrapak.com
Base Domain tetrapak.com
Scan Status Ok
Last Scan2024-05-12T12:47:20+00:00
Next Scan 2024-06-11T12:47:20+00:00

Last Scan

Scanned2024-05-12T12:47:20+00:00
URL https://tetrapak.com/robots.txt
Redirect https://www.tetrapak.com/robots.txt
Redirect Domain www.tetrapak.com
Redirect Base tetrapak.com
Domain IPs 13.248.160.137
Redirect IPs 132.147.114.72, 163.171.211.114
Response IP 132.147.114.72
Found Yes
Hash 21ec0e787e06ee947a3cce8a0e2f1006998137a0edc314342aaf2e577db42b15
SimHash 4049ce57c592

Groups

onetrustbot

Rule Path
Allow /

*

Rule Path
Disallow /*/error/*

Other Records

Field Value
sitemap https://www.tetrapak.com/sitemap.xml

Comments

  • Disallow paths for Googlebot crawler
  • User-agent: Googlebot
  • Disallow: /nogooglebot/
  • Allow OneTrustBot for All crawler
  • Disallow error path for All crawler
  • Sitemap Index Path