vremeaonline.com
robots.txt

Robots Exclusion Standard data for vremeaonline.com

Resource Scan

Scan Details

Site Domain vremeaonline.com
Base Domain vremeaonline.com
Scan Status Ok
Last Scan2025-10-28T00:38:37+00:00
Next Scan 2025-10-29T00:38:37+00:00

Last Scan

Scanned2025-10-28T00:38:37+00:00
URL https://www.vremeaonline.com/robots.txt
Domain IPs 104.21.12.141, 172.67.194.231, 2606:4700:3033::ac43:c2e7, 2606:4700:3034::6815:c8d
Response IP 172.67.194.231
Found Yes
Hash c67ff3f839c43ea6ad2cdd269a833e36974549571c0a366b562c83cd6d8ad6d9
SimHash 6d04c6524793

Groups

googlebot-image

Rule Path
Disallow /

mediapartners-google

Rule Path
Allow /

*

Rule Path
Disallow /wap/
Disallow /ton/
Disallow /wml2html/
Disallow /wb/
Disallow /werbung/
Disallow /reports/philip-eden/
Allow /

Other Records

Field Value
sitemap https://www.weatheronline.de/sitemap_de.xml
sitemap https://www.weatheronline.de/sitemap_de_trend.xml
sitemap https://www.weatheronline.de/sitemap_all.xml

Comments

  • Anweisungen für Bing:
  • User-agent: bingbot
  • Disallow: /ton/
  • Disallow: /css/
  • Disallow: /karten/
  • Crawl-delay: 8