emailsanta.com
robots.txt

Robots Exclusion Standard data for emailsanta.com

Resource Scan

Scan Details

Site Domain emailsanta.com
Base Domain emailsanta.com
Scan Status Ok
Last Scan2025-10-10T10:22:27+00:00
Next Scan 2025-10-17T10:22:27+00:00

Last Scan

Scanned2025-10-10T10:22:27+00:00
URL https://emailsanta.com/robots.txt
Redirect https://www.emailsanta.com/robots.txt
Redirect Domain www.emailsanta.com
Redirect Base emailsanta.com
Domain IPs 104.21.45.53, 172.67.210.76, 2606:4700:3034::6815:2d35, 2606:4700:3034::ac43:d24c
Redirect IPs 104.21.45.53, 172.67.210.76, 2606:4700:3034::6815:2d35, 2606:4700:3034::ac43:d24c
Response IP 104.21.45.53
Found Yes
Hash 66f7b1bf38a53498db1fe07cb8a03d02146483f8ca55a114e0d7df8ebb8c6932
SimHash a34ce9558ca7

Groups

*

Product Comment
* directed to all spiders
Rule Path Comment
Disallow */_fpclass/* Directory- FP created directory
Disallow */_private/* Directory- FP created directory
Disallow */_themes/* Directory- FP created directory
Disallow */audio/* Directory- proprietary sounds
Disallow */cdn-cgi/* CloudFlare crawl error
Disallow */cgi-bin/* Directory- cgi-bin
Disallow */dbase/* Directory- response programming
Disallow */dnx/* Directory- FP created directory
Disallow */test/* Directory- site test files
Disallow */frlet_pere* reply from Pere Noel
Disallow */frnav* navbars: French
Disallow */let_pet* reply from Rudolph
Disallow */bearspaw.asp bearspaw school page for letters to Santa
Disallow */search.asp -
Disallow */Search.asp -
Disallow */XmasEve_Tracker-iFrame* -
Disallow */santa-claus-xmas-blog/robots.txt -
Disallow */santa-claus-xmas-blog/sitemap.xml -
Disallow */santa-claus-xmas-blog/sitemap.xml.gz -
Disallow */fbi.asp -
Disallow */swf/* SWF player
Disallow */z_c* -
Disallow */z_D* -
Disallow */z_R* -
Disallow */ze* -
Disallow */zG* -
Allow /zTemplate-Loon.asp -
Disallow */zu* -
Disallow */xml* -
Allow */zh-* -
Disallow *santa-player.swf -
Disallow *santa-claus-blog.emailsanta.com/tag/* -

googlebot

Rule Path Comment
Disallow */christmas-cards/holiday-cards-send* Hack so Twitterbot can crawl these pages (needs meta tags to create pic) but Google doesn't

scoutjet

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.emailSanta.com/sitemap.xml
sitemap https://www.emailSanta.com/sitemap-international.xml
sitemap https://santa-claus-blog.emailsanta.com/post-sitemap.xml

Comments

  • User-agent: Mediapartners-Google
  • Disallow: # as per https://support.google.com/adsense/answer/10532
  • Disallow: */js/* # Directory- Liquid Motion created directory
  • Disallow: */images/* # Directory- proprietary images (re-enabled for Twitter cards)
  • Disallow: */letsanta* # reply from Santa. Commented out as Adsense needs to be able to see it
  • Disallow: */santa-claus-xmas-blog/* # Old WordPress site
  • Disallow: */wp-admin/* # WordPress backend materials
  • Disallow: */wp-includes/* # WordPress backend materials
  • Disallow: */wp-content/plugins/* # WordPress backend materials
  • Disallow: */wp-content/themes/* # WordPress backend materials
  • Disallow: */twenty* # WordPress backend materials
  • Disallow: */santa-claus-xmas-blog/wp-content/themes/default/* # WordPress backend materials
  • Disallow: */function.include* # WordPress backend materials
  • Disallow: */santa-claus-xmas-blog/tag/*
  • Disallow: */category/*
  • Disallow: /santa-claus-xmas-blog/2013/*
  • Disallow: /santa-claus-xmas-blog/2012/*
  • Disallow: /santa-claus-xmas-blog/2011/*
  • Disallow: /santa-claus-xmas-blog/2010/*
  • Disallow: /santa-claus-xmas-blog/2009/*
  • Disallow: /santa-claus-xmas-blog/2008/*
  • Disallow: /santa-claus-xmas-blog/?*
  • Disallow: */zT*
  • Disallow: */santa-claus-xmas-blog/?p=*
  • Allow: */santa-claus-xmas-blog/?p=*&cp=*
  • Disallow: */santa-claus-xmas-blog/author*
  • Disallow: */santa-claus-xmas-blog/page*
  • ScoutJet is IBM Watson's bot. Banned as throws numerous 404s. Nov 2017
  • Sitemap: https://www.emailSanta.com/sitemap-main - color.xml
  • Sitemap: https://www.emailSanta.com/sitemap-main - games.xml
  • Sitemap: https://www.emailSanta.com/sitemap-main - jokes.xml
  • Sitemap: https://www.emailSanta.com/sitemap-main - photos.xml
  • Sitemap: https://www.emailSanta.com/sitemap-main - remainder - 01-13.xml
  • Sitemap: https://www.emailSanta.com/sitemap-main - remainder - 13-26.xml
  • Sitemap: https://www.emailSanta.com/sitemap-main - remainder - 27-34.xml
  • Sitemap: https://www.emailSanta.com/sitemap-main - songs.xml
  • Sitemap: https://www.emailSanta.com/sitemap-blog.xml
  • Last updated Nov 6, 2023 (commented out Mediapartners-Google)
  • Prev updated Oct 26, 2019