globecastwebtv.com
robots.txt

Robots Exclusion Standard data for globecastwebtv.com

Resource Scan

Scan Details

Site Domain globecastwebtv.com
Base Domain globecastwebtv.com
Scan Status Ok
Last Scan2024-11-14T14:58:07+00:00
Next Scan 2024-11-21T14:58:07+00:00

Last Scan

Scanned2024-11-14T14:58:07+00:00
URL https://globecastwebtv.com/robots.txt
Domain IPs 213.186.33.4
Response IP 213.186.33.4
Found Yes
Hash 710ed69b20a234279e74a7680c6e4447f6de140b4a64f3e280e1e0634cb21fe1
SimHash 04449f504870

Groups

google-adstxt

Rule Path
Disallow /
Allow /ads.txt
Allow /app-ads.txt

mediapartners-google

Rule Path
Disallow /
Allow /ads.txt
Allow /app-ads.txt

googlebot

Rule Path
Disallow /
Allow /ads.txt
Allow /app-ads.txt

*

Rule Path
Disallow /

Comments

  • robots.txt for globecastwebtv.com
  • last updated: 2024-08-26
  • syntax
  • User-agent: * (any) or <crawler-useragent>
  • Disallow: /<path> (deny for that path) or / (deny all) or <empty> (allow all)
  • Allow: /<path>
  • Ad serving: let ad platforms finds ad serving files
  • deny robots