tech.utzer.de
robots.txt

Robots Exclusion Standard data for tech.utzer.de

Resource Scan

Scan Details

Site Domain tech.utzer.de
Base Domain utzer.de
Scan Status Ok
Last Scan2024-10-04T04:00:40+00:00
Next Scan 2024-10-05T04:00:40+00:00

Last Scan

Scanned2024-10-04T04:00:40+00:00
URL https://tech.utzer.de/robots.txt
Domain IPs 2a01:4f9:4a:3793:0:10:0:d3de, 95.217.122.172
Response IP 95.217.122.172
Found Yes
Hash 244011ab131c213ef195aa4cc5c6df88f02b686c25df5d80776d6ce73a91a401
SimHash 41558853e675

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /trackback
Disallow /comments
Disallow /category/*/*
Disallow */trackback
Disallow */comments
Disallow /*?*
Disallow /*?
Allow /wp-content/uploads
Allow /?page_id=
Allow /?p=*

mediapartners-google*

Rule Path
Disallow
Allow /*

ia_archiver

Rule Path
Allow /*

duggmirror

Rule Path
Disallow /

Other Records

Field Value
sitemap http://blog.utzer.de/sitemap.xml.gz

Comments

  • BEGIN XML-SITEMAP-PLUGIN
  • END XML-SITEMAP-PLUGIN
  • Google Image
  • User-agent: Googlebot-Image
  • Disallow: /*
  • Google AdSense
  • Internet Archiver Wayback Machine
  • digg mirror

Warnings

  • 1 invalid line.