gurgaonmoms.com
robots.txt

Robots Exclusion Standard data for gurgaonmoms.com

Resource Scan

Scan Details

Site Domain gurgaonmoms.com
Base Domain gurgaonmoms.com
Scan Status Ok
Last Scan2024-09-08T01:15:08+00:00
Next Scan 2024-10-08T01:15:08+00:00

Last Scan

Scanned2024-09-08T01:15:08+00:00
URL https://gurgaonmoms.com/robots.txt
Domain IPs 104.21.82.158, 172.67.158.244, 2606:4700:3032::ac43:9ef4, 2606:4700:3035::6815:529e
Response IP 104.21.82.158
Found Yes
Hash 0aa8b244321b50bf3d9e91ad486766c67395e1ebc8a08650ff7eab2523508030
SimHash 12f80c32a8f2

Groups

*

Rule Path Comment
Disallow /calendar/action~posterboard/ -
Disallow /calendar/action~agenda/ -
Disallow /calendar/action~oneday/ -
Disallow /calendar/action~month/ -
Disallow /calendar/action~week/ -
Disallow /calendar/action~stream/ -
Disallow /calendar/action~undefined/ -
Disallow /calendar/action~http%3A/ -
Disallow /calendar/action~default/ -
Disallow /calendar/action~poster/ -
Disallow /calendar/action~*/ -
Disallow /*controller%3Dai1ec_exporter_controller* -
Disallow /*/action~*/ -
Disallow /calendar-2/action~posterboard/ -
Disallow /calendar-2/action~agenda/ -
Disallow /calendar-2/action~oneday/ -
Disallow /calendar-2/action~month/ -
Disallow /calendar-2/action~week/ -
Disallow /calendar-2/action~stream/ -
Disallow /calendar-2/action~undefined/ -
Disallow /calendar-2/action~http%3A/ -
Disallow /calendar-2/action~default/ -
Disallow /calendar-2/action~poster/ -
Disallow /calendar-2/action~*/ -
Disallow /wp-admin/ block access to admin section
Disallow /wp-login.php block access to admin section
Disallow /search/ block access to internal search result pages
Disallow *?s=* block access to internal search result pages
Disallow *?p=* block access to pages for which permalinks fails
Disallow *%26p%3D* block access to pages for which permalinks fails
Disallow *%26preview%3D* block access to preview pages
Disallow /tag/ block access to tag pages
Disallow /author/ block access to author pages
Disallow /404-error/ block access to 404 page

mj12bot

Rule Path
Disallow /

mail.ru
dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

sistrix
sistrix crawler
sistrix
seokicks-robot
jobs.de-robot
ahrefsbot

Rule Path
Disallow /

unisterbot
dotbot
searchmetricsbot
surveybot
seodiver
spbot
wotbox
meanpathbot
backlinkcrawler
magpie-crawler
obot
fr-crawler
blexbot
megaindex.ru
megaindex.com
cloudservermarketspider
trendictionbot
exabot
careerbot
lipperhey-kaus-australis
seoscanners.net
metajobbot
spiderbot
linkstats
jobboersebot
iccrawler
plista
domain re-animator bot
turnitinbot
coccoc
um-ic
mindupbot
sg-orbiter
ccbot
qwantify
kraken
plukkie
safednsbot
haosouspider
rogerbot
openhosebot
screaming frog seo spider
thumbsniper
r6_commentreader
implisensebot
cliqzbot
aihitbot
adscanner
crawler4j
wbsearchbot
python/3.5 aiohttp
toweya.com
netestate
bubing
linguee
semrushbot
semrushbot-sa
sentibot
sentibot
velenpublicwebcrawler
domaincrawler
indeedbot
garlikcrawler
gosign-security-crawler
siteliner
sabsimbot
ltx71

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://gurgaonmoms.com/sitemap.xml

Comments

  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
  • Slow down bots
  • Disallow: Sistrix
  • Disallow: SEOkicks-Robot
  • Disallow: jobs.de-Robot
  • Backlink Analysis
  • Bot der Leipziger Unister Holding GmbH
  • http://www.opensiteexplorer.org/dotbot
  • http://www.searchmetrics.com
  • http://www.majestic12.co.uk/projects/dsearch/mj12bot.php
  • http://www.domaintools.com/webmasters/surveybot.php
  • http://www.seodiver.com/bot
  • http://openlinkprofiler.org/bot
  • http://www.wotbox.com/bot/
  • http://www.meanpath.com/meanpathbot.html
  • http://www.backlinktest.com/crawler.html
  • http://www.brandwatch.com/magpie-crawler/
  • http://filterdb.iss.net/crawler/
  • http://webmeup-crawler.com
  • https://megaindex.com/crawler
  • http://www.cloudservermarket.com
  • http://www.trendiction.de/de/publisher/bot
  • http://www.exalead.com
  • http://www.career-x.de/bot.html
  • https://www.lipperhey.com/en/about/
  • https://turnitin.com/robot/crawlerinfo.html
  • http://help.coccoc.com/
  • ubermetrics-technologies.com
  • datenbutler.de
  • http://searchgears.de/uber-uns/crawling-faq.html
  • http://commoncrawl.org/faq/
  • https://www.qwant.com/
  • http://linkfluence.net/
  • http://www.botje.com/plukkie.htm
  • https://www.safedns.com/searchbot
  • http://www.haosou.com/help/help_3_2.html
  • http://www.moz.com/dp/rogerbot
  • http://www.openhose.org/bot.html
  • http://www.screamingfrog.co.uk/seo-spider/
  • http://thumbsniper.com
  • http://www.radian6.com/crawler
  • http://cliqz.com/company/cliqzbot
  • https://www.aihitdata.com/about
  • http://www.trendiction.com/en/publisher/bot
  • http://seocompany.store
  • https://github.com/yasserg/crawler4j/
  • http://warebay.com/bot.html
  • http://www.website-datenbank.de/
  • http://law.di.unimi.it/BUbiNG.html
  • http://www.linguee.com/bot; bot@linguee.com
  • https://www.semrush.com/bot/
  • www.sentibot.eu
  • http://velen.io
  • https://moz.com/help/guides/moz-procedures/what-is-rogerbot
  • http://www.garlik.com
  • https://www.gosign.de/typo3-extension/typo3-sicherheitsmonitor/
  • http://www.siteliner.com/bot
  • https://sabsim.com
  • http://ltx71.com/
  • END

Warnings

  • 1 invalid line.