twgtea.com
robots.txt

Robots Exclusion Standard data for twgtea.com

Resource Scan

Scan Details

Site Domain twgtea.com
Base Domain twgtea.com
Scan Status Ok
Last Scan2024-10-28T03:09:08+00:00
Next Scan 2024-11-27T03:09:08+00:00

Last Scan

Scanned2024-10-28T03:09:08+00:00
URL https://twgtea.com/robots.txt
Domain IPs 13.33.88.106, 13.33.88.44, 13.33.88.83, 13.33.88.91, 2600:9000:223b:1600:b:2ab8:bdc0:93a1, 2600:9000:223b:2a00:b:2ab8:bdc0:93a1, 2600:9000:223b:2c00:b:2ab8:bdc0:93a1, 2600:9000:223b:4400:b:2ab8:bdc0:93a1, 2600:9000:223b:5200:b:2ab8:bdc0:93a1, 2600:9000:223b:b600:b:2ab8:bdc0:93a1, 2600:9000:223b:f600:b:2ab8:bdc0:93a1, 2600:9000:223b:fe00:b:2ab8:bdc0:93a1
Response IP 13.33.88.91
Found Yes
Hash 3e067474b5ea89c57e00ad488cd53712a24c50c4bfc5272afbb57dd242b26791
SimHash cceec202afb5

Groups

*

Rule Path
Allow /
Disallow /*.pdf

Other Records

Field Value
sitemap https://twgtea.com/sitemap.xml

Warnings

  • 11 invalid lines.