pixelwit.com
robots.txt

Robots Exclusion Standard data for pixelwit.com

Resource Scan

Scan Details

Site Domain pixelwit.com
Base Domain pixelwit.com
Scan Status Ok
Last Scan2025-10-26T14:12:20+00:00
Next Scan 2025-11-25T14:12:20+00:00

Last Scan

Scanned2025-10-26T14:12:20+00:00
URL https://pixelwit.com/robots.txt
Domain IPs 192.254.186.176
Response IP 192.254.186.176
Found Yes
Hash bf089282d2ccd8c397ead3f13b8b3b91105a8da90b6c73051499b5fdf505a5c1
SimHash 2c4eb703ca62

Groups

*

Rule Path
Disallow /pages/imagepagesx/
Disallow /Scripts/
Disallow /temp/
Disallow /pics/
Disallow /blog/image-galleries/hiking/
Disallow /paid/
Disallow /tile/
Disallow /cgi-bin/
Disallow /manual
Disallow /manual/*
Disallow /phpmanual/
Disallow /blog/contact/
Disallow /blog/wp-admin/
Disallow /blog/wp-includes/
Disallow /blog/category/
Disallow /blog/page-flip/improved-pageflip-paid/
Disallow /blog/page-flip/improved-pageflip-file/
Disallow /blog/page-flip/file-confirmation/

googlebot

Rule Path
Disallow /pages/imagepagesx/
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.gz$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /*?*

duggmirror

Rule Path
Disallow /

mediapartners-google*

Rule Path
Disallow
Allow /*

Comments

  • disallow all files in these directories
  • disallow all files ending with these extensions
  • disallow all files with ? in url
  • disable duggmirror
  • allow adsense bot on entire site