stcwdc.org
robots.txt

Robots Exclusion Standard data for stcwdc.org

Resource Scan

Scan Details

Site Domain stcwdc.org
Base Domain stcwdc.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-08-26T04:26:59+00:00
Next Scan 2025-11-24T04:26:59+00:00

Last Successful Scan

Scanned2025-04-05T20:17:49+00:00
URL https://stcwdc.org/robots.txt
Domain IPs 34.174.210.226
Response IP 34.174.210.226
Found Yes
Hash 79a62660cc099f60ce5cf75e58722742178a67abeecab5ba3fb55a9bae5fddf3
SimHash 4a7d6d935cfa

Groups

dialect

Rule Path
Disallow /

*
googlebot

Rule Path
Disallow /*URL
Disallow /*.shtmlURL
Disallow /y_key_a9c4cb2268248cca.html
Disallow /xd_receiver.htm
Disallow /google04ff9791bf01746a.html
Disallow /204e6f0c744aaa9f2240026805e6bffeed3e8c72.php
Disallow /delorie.htm

*
googlebot

Rule Path
Disallow /wp-content/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-content/update/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-
Disallow /______
Allow /sitemap.xml

*
googlebot

Rule Path
Disallow /cache/
Disallow /dirpas/
Disallow /errors/
Disallow /events/*
Disallow /includes/*
Disallow /mentor/
Disallow /tag/*
Disallow /tags/
Disallow /tag/
Disallow /cgi-bin/
Disallow /index.php/
Disallow /categories/*
Disallow /category/*
Disallow /page/*
Disallow /photos/
Disallow /photos/*
Disallow /php/*
Disallow /search/
Disallow /sets/
Disallow /show/
Disallow /sound/
Disallow /tests/
Disallow /zips/
Disallow /2008/*
Disallow /2009/*
Disallow /2010/*
Disallow /2011/*
Disallow /2012/*
Disallow /buy*
Disallow /canada*
Disallow /cheap*
Disallow /discount*
Disallow /lowest*
Disallow /online*
Disallow /very*
Disallow /viagra*
Disallow *buy*
Disallow *cheap*
Disallow *cialis*
Disallow *discount*
Disallow *drug*
Disallow *fuck*
Disallow *generic*
Disallow *pharm*
Disallow *prescription*
Disallow *viagra*

*
googlebot

Rule Path
Disallow /?author=*
Disallow /?cat=*
Disallow */feed/
Disallow /feed/
Disallow /trackback/
Disallow */trackback*
Disallow */comments
Disallow /*?*
Disallow /*?
Disallow /?stc=*

*
googlebot

Rule Path
Disallow /polls/
Disallow /pollsarchive/
Disallow /useronline/
Disallow /stats/
Disallow /stats*
Disallow /survey/
Disallow /surveys/
Disallow /author
Disallow /author/
Disallow /tag*
Disallow /tags*
Disallow /docs*
Disallow /manual*
Disallow /category/uncategorized*

*
googlebot

Rule Path
Disallow /*.cgi$
Disallow /*.css$
Disallow /*.gz$
Disallow /*.inc*
Disallow /*.inc$
Disallow /*.js$
Disallow /*.php$
Disallow /*.php*
Disallow /*.site$
Disallow /*.rss$
Disallow /*.wmv$
Disallow /*.xsl$

googlebot-image

Rule Path
Allow /*

googlebot-mobile

Rule Path
Allow /*

mediapartners-google

Rule Path
Disallow /*

adsbot-google

Rule Path
Disallow /*

ezooms

Rule Path
Disallow /

markmonitor

Rule Path
Disallow /

static.flickr.com

Rule Path
Allow /*

validator.w3.org

Rule Path
Allow /*

ia_archiver

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

yahoo-slurp

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider-mobile

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-news

Rule Path
Disallow /

baiduspider-favo

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou spider2

Rule Path
Disallow /

sogou blog

Rule Path
Disallow /

sogou news spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

yandex spider

Rule Path
Disallow /

yesspider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://archives.stcwdc.org/sitemap.xml

Comments

  • disallow the following specific files
  • disallow all files in these WordPress directories
  • disallow all files in these directories
  • disallow robots from parsing individual post feeds and trackbacks
  • disallow any files that are stats related
  • disallow files ending with the following extensions
  • disallow WayBack archiving site