tuxtweaks.com
robots.txt

Robots Exclusion Standard data for tuxtweaks.com

Resource Scan

Scan Details

Site Domain tuxtweaks.com
Base Domain tuxtweaks.com
Scan Status Ok
Last Scan2025-09-06T12:11:27+00:00
Next Scan 2025-09-13T12:11:27+00:00

Last Scan

Scanned2025-09-06T12:11:27+00:00
URL https://tuxtweaks.com/robots.txt
Domain IPs 104.21.2.108, 172.67.129.25, 2606:4700:3032::ac43:8119, 2606:4700:3037::6815:26c
Response IP 104.21.2.108
Found Yes
Hash 0c5d7eccf2b67352004a8d07533e5bfd3886fca92763dc50ea563ba0d5041eee
SimHash 41158ad3a265

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /trackback
Disallow /feed
Disallow /comments
Disallow /category/*/*
Disallow */trackback
Disallow */feed
Disallow */comments
Allow /wp-content/uploads

Other Records

Field Value
crawl-delay 10

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

ia_archiver

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

Other Records

Field Value
sitemap https://tuxtweaks.com/sitemap.xml.gz

Comments

  • Disallow: /*?*
  • Disallow: /*?
  • Google Image
  • Google AdSense
  • Internet Archiver Wayback Machine
  • digg mirror