neteron.ru
robots.txt
Robots Exclusion Standard data for neteron.ru
Resource Scan
Scan Details
Site Domain | neteron.ru |
Base Domain | neteron.ru |
Scan Status | Ok |
Last Scan | 2024-11-03T14:30:20+00:00 |
Next Scan | 2024-12-03T14:30:20+00:00 |
Last Scan
Scanned | 2024-11-03T14:30:20+00:00 |
URL | https://neteron.ru/robots.txt |
Domain IPs | 141.8.192.102, 141.8.196.46, 2a0a:2b43:7:9fdd:: |
Response IP | 141.8.192.102 |
Found | Yes |
Hash | f138a82158b4b4d6f1119157f677f7dfe769af9aa23bc06641538ef49f7aa865 |
SimHash | 51818753abf7 |
Groups
*
Rule | Path |
---|---|
Allow | /sitemap.php |
Disallow | /index.php? |
Disallow | /*index.php* |
Disallow | /admin |
Disallow | /api |
Disallow | /applications* |
Disallow | /datastore |
Disallow | /dev |
Disallow | /oauth |
Disallow | /plugins |
Disallow | /system |
Disallow | /uploads |
Disallow | /vendor |
Disallow | /404error.php |
Disallow | /Credits.txt |
Disallow | /error.php |
Disallow | /login/ |
Disallow | /logout |
Disallow | /register |
Disallow | /lostpassword |
Disallow | /privacy |
Disallow | /cookies |
Disallow | /guidelines |
Disallow | /terms |
Disallow | /online |
Disallow | /staff |
Disallow | /contact |
Disallow | /discover/ |
Disallow | /profile/ |
Disallow | /*announcement |
Disallow | /search |
Disallow | /*profile* |
Disallow | /profile |
Disallow | /*login |
Disallow | /rss* |
Disallow | /activity |
Disallow | /new-content |
Disallow | /*promote |
Disallow | /ourpicks |
Disallow | /leaderboard |
Disallow | /pastleaders |
Disallow | /topmembers |
Disallow | /terms* |
Disallow | /*do%3D* |
Disallow | /*sort%3D* |
Disallow | /*sortby%3D* |
Disallow | /*csrf%3D* |
Disallow | /*csrfKey%3D* |
Disallow | */?tab=* |
Disallow | */?_fromLogin=* |
Disallow | */?_fromLogout=* |
Disallow | */submit |
Disallow | */create |
Disallow | */edit |
Disallow | /terms |
Disallow | /*terms* |
Disallow | /?app* |
Disallow | */?app |
Disallow | /discover/ |
Disallow | /discover/* |
Disallow | /forum/applications/* |
Disallow | /forum/discover/* |
Allow | /uploads/monthly_*_*/* |
axmorobot - crawling your site for better indexing on www.axmo.com search engine.
Rule | Path |
---|---|
Disallow | / |
bdncentral crawler v2.3 [en] (http://www.bdncentral.com/robot.html) (x11; i; linux 2.0.44 i686)
Rule | Path |
---|---|
Disallow | / |
bigcliquebot/1.03-dev (bigclicbot; http://www.bigclique.com; bot@bigclique.com)
Rule | Path |
---|---|
Disallow | / |
carnegie_mellon_university_webcrawler,http://www.andrew.cmu.edu/~brgordon/webbot/index.html
Rule | Path |
---|---|
Disallow | / |
carnegie_mellon_university_research_webbot-->please read-->http://www.andrew.cmu.edu/~brgordon/webbot/index.html
Rule | Path |
---|---|
Disallow | / |
creativecommons/0.06-dev (nutch; http://www.nutch.org/docs/en/bot.html; nutch-agent@lists.sourceforge.net)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/2.2.6 (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/2.2.7 (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/2.2.8 (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/2.2.10 (multimedia search) (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/3.3 (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/3.4/nirvana (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/3.4/partnersite (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/3.5 (atw-crawler at fast dot no; http://fast.no/support.php?c=faqs/crawler)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/3.6 (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/3.7 (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/3.8 (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/3.8/fresh (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/3.6/firstpage (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
Rule | Path |
---|---|
Disallow | / |
fast-webcrawler/3.7/firstpage (atw-crawler at fast dot no;http://fast.no/support/crawler.asp)
Rule | Path |
---|---|
Disallow | / |
filangy/0.01-beta (filangy; http://www.nutch.org/docs/en/bot.html; filangy-agent@filangy.com)
Rule | Path |
---|---|
Disallow | / |
filangy/1.0x (filangy; http://www.nutch.org/docs/en/bot.html; filangy-agent@filangy.com)
Rule | Path |
---|---|
Disallow | / |
freefind.com-sitesearchengine/1.0 (http://freefind.com; spiderinfo@freefind.com)
Rule | Path |
---|---|
Disallow | / |
hoowwwer/2.1.0 (+http://cosco.hiit.fi/search/hoowwwer/ | mailto:crawler-info<at>hiit.fi)
Rule | Path |
---|---|
Disallow | / |
iltrovatore-setaccio/0.3-dev (indexing; http://www.iltrovatore.it/bot.html; info@iltrovatore.it)
Rule | Path |
---|---|
Disallow | / |
iltrovatore-setaccio/1.2 (it-bot; http://www.iltrovatore.it/bot.html; info@iltrovatore.it)
Rule | Path |
---|---|
Disallow | / |
iltrovatore-setaccio/1.2-dev (spidering; http://www.iltrovatore.it/aiuto/.....)
Rule | Path |
---|---|
Disallow | / |
incywincy data gatherer(webmaster@loopimprovements.com,http://www.loopimprovements.com/robot.html)
Rule | Path |
---|---|
Disallow | / |
incywincy page crawler(webmaster@loopimprovements.com,http://www.loopimprovements.com/robot.html)
Rule | Path |
---|---|
Disallow | / |
listbidbot (freelance job spider http://listbid.com)<a href=http://listbid.com>freelance</a>
Rule | Path |
---|---|
Disallow | / |
lynx/2.8.4rel.1 libwww-fm/2.14 ssl-mm/1.4.1 openssl/0.9.6c (human-guided@lerly.net)
Rule | Path |
---|---|
Disallow | / |
maxomobot/dev-20051201 (maxomo; http://67.102.134.34:4047/maxomo/maxomobot.html; maxomobot@maxomo.com)
Rule | Path |
---|---|
Disallow | / |
metaspinner/0.01 (metaspinner; http://www.meta-spinner.de/; support@meta-spinner.de/)
Rule | Path |
---|---|
Disallow | / |
microsoftprototypecrawler (how's my crawling? mailto:newbiecrawler@hotmail.com)
Rule | Path |
---|---|
Disallow | / |
nextgensearchbot 1 (for information visit http://www.eliyon.com/nextgensearchbot)
Rule | Path |
---|---|
Disallow | / |
nusearch spider <a href='http://www.nusearch.com'>www.nusearch.com</a> (compatible; msie 4.01)
Rule | Path |
---|---|
Disallow | / |
nutchcvs/0.0x-dev (nutch; http://www.nutch.org/docs/bot.html; nutch-agent@lists.sourceforge.net)
Rule | Path |
---|---|
Disallow | / |
nutchorg/0.0x-dev (nutch; http://www.nutch.org/docs/bot.html; nutch-agent@lists.sourceforge.net)
Rule | Path |
---|---|
Disallow | / |
objectssearch/0.01-dev (objectssearch;http://www.objectssearch.com/bot.html; support@thesoftwareobjects.com)
Rule | Path |
---|---|
Disallow | / |
openbot/3.0+(robot-response@openfind.com.tw;+http://www.openfind.com.tw/robot.html)
Rule | Path |
---|---|
Disallow | / |
openfind data gatherer, openbot/3.0+(robot-response@openfind.com.tw;+http://www.openfind.com.tw/robot.html)
Rule | Path |
---|---|
Disallow | / |
overture-webcrawler/3.8/fresh (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
Rule | Path |
---|---|
Disallow | / |
pipeliner/0.10 (pipeline spider; http://www.pipeline-search.com/webmaster.html)
Rule | Path |
---|---|
Disallow | / |
pipeliner/0.3a (pipeline spider;http://www.pipeline-search.com/webmaster.html; webmaster'at'pipeline-search.com)
Rule | Path |
---|---|
Disallow | / |
schwarzmann.biz-spider_for_paddel.org+(http://www.innerprise.net/usp-spider.asp)
Rule | Path |
---|---|
Disallow | / |
searchbyusa/2 (searchbyusa; http://www.searchbyusa.com/bot.html; info@searchbyusa.com)
Rule | Path |
---|---|
Disallow | / |
searchspider/1.2 (searchspider; http://www.searchspider.com; webmaster@searchspider.com)
Rule | Path |
---|---|
Disallow | / |
sitecheck.internetseer.com (for more info see: http://sitecheck.internetseer.com)
Rule | Path |
---|---|
Disallow | / |
snoopy v1.xx, : user-agent: mozilla/4.0 (compatible; msie 6.0; windows nt 5.1; myie2)
Rule | Path |
---|---|
Disallow | / |
talkro web-shot/1.0 (e-mail: webshot@daumsoft.com, home: http://222.122.15.190/webshot)
Rule | Path |
---|---|
Disallow | / |
tulipchain/5.x (http://ostermiller.org/tulipchain/) java/1.x.1_0x (http://java.sun.com/) linux/2.4.17
Rule | Path |
---|---|
Disallow | / |
tulipchain/5.xx (http://ostermiller.org/tulipchain/) java/1.x.1_0x (http://apple.com/) mac_os_x/10.2.8
Rule | Path |
---|---|
Disallow | / |
unchaos (from chaos to order, hybrid web search engine. (vadim_gonchar@unchaos.com))
Rule | Path |
---|---|
Disallow | / |
unchaosbot (from chaos to order, unchaos hybrid web search engine at www.unchaos.com (info@unchaos.com))
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://neteron.ru/sitemap.php |
Warnings
- 4 invalid lines.