nhl-news.ru
robots.txt

Robots Exclusion Standard data for nhl-news.ru

Resource Scan

Scan Details

Site Domain nhl-news.ru
Base Domain nhl-news.ru
Scan Status Ok
Last Scan2024-09-30T13:08:41+00:00
Next Scan 2024-10-07T13:08:41+00:00

Last Scan

Scanned2024-09-30T13:08:41+00:00
URL https://nhl-news.ru/robots.txt
Domain IPs 104.21.65.161, 172.67.147.38, 2606:4700:3032::ac43:9326, 2606:4700:3033::6815:41a1
Response IP 172.67.147.38
Found Yes
Hash 10e4519de8d598faf89eead86ed88f8ff859dfd18c7d4014ad867b34c0f7f8a2
SimHash e996050babf0

Groups

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

crazywebcrawler-spider

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

spbot

Rule Path
Disallow /

crawl

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

msie*

Rule Path
Disallow /

seznambot*

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /
Allow /content/*
Allow /caphits/*
Allow /all_about_team/*
Allow /highlights
Allow /all-*
Allow /all_about_team
Allow /broadcast
Allow /transfery
Allow /fights
Allow /blog/*
Allow /tagsforvideotax/matchi-v-zapisi
Allow /statistika
Allow /numbers
Allow /farms_ahl

Other Records

Field Value
crawl-delay 900

*

Rule Path
Allow /sitemap.xml
Allow /tagsforvideotax/supermatchi
Allow /tagsforvideotax/matchi-v-zapisi
Allow /tags/het-trik-gordi-hou
Allow /tags/oceni-treyd
Disallow /tag/
Disallow /*/tag/*
Disallow /tags/
Disallow /*/tags/*
Disallow /taxonomy/
Disallow /*/taxonomy/*
Disallow /term/
Disallow /includes/
Disallow /misc/
Disallow /modules/
Disallow /profiles/
Disallow /scripts/
Disallow /themes/
Disallow /CHANGELOG.txt
Disallow /cron.php
Disallow /INSTALL.mysql.txt
Disallow /INSTALL.pgsql.txt
Disallow /INSTALL.sqlite.txt
Disallow /install.php
Disallow /INSTALL.txt
Disallow /LICENSE.txt
Disallow /MAINTAINERS.txt
Disallow /update.php
Disallow /UPGRADE.txt
Disallow /xmlrpc.php
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips/
Disallow /node/add/
Disallow /search/
Disallow /user/
Disallow /user/register/
Disallow /user/password/
Disallow /user/login/
Disallow /user/logout/
Disallow /tag/
Disallow /tags/
Disallow /taxonomy/
Disallow /term/
Disallow /?q=admin%2F
Disallow /?q=comment%2Freply%2F
Disallow /?q=filter%2Ftips%2F
Disallow /?q=node%2Fadd%2F
Disallow /?q=search%2F
Disallow /?q=user%2Fpassword%2F
Disallow /?q=user%2Fregister%2F
Disallow /?q=user%2Flogin%2F
Disallow /?q=user%2Flogout%2F
Disallow /admin
Disallow /comment
Disallow /filter/tips
Disallow /node/add
Disallow /search
Disallow /user
Disallow /user/register
Disallow /user/password
Disallow /user/login
Disallow /user/logout
Disallow /tag
Disallow /tags
Disallow /taxonomy
Disallow /term
Disallow /%D*
Disallow /advstats?*
Disallow /tablenhl?*
Disallow /*?field*
Disallow /*page%3D*
Disallow /?page=*
Disallow /%26page*
Disallow /*?destination*
Disallow /?qt-maintabs
Disallow /*?option*
Disallow /*?tid*
Disallow /aggregator*
Disallow /?do*
Disallow /*rate%3D*
Disallow /node/*
Disallow /video/*
Disallow /Novosti-NHL/*
Disallow /Novosti-NHL
Disallow /novosti-NHL*
Disallow /dlya-igr/*
Disallow /games/*
Disallow /forstats/*
Disallow /*?feed*
Disallow /*?iframe*
Disallow /*?from*
Disallow /views/ajax*
Disallow /*?qt*
Disallow /contact*
Disallow /*?date*
Disallow /*?title*
Disallow /statistika/*
Disallow /*m%3D*
Disallow /transfery/*
Disallow /*?order=title&sort=desc
Disallow /*?order*
Disallow /*?cid=*
Disallow /*image_captcha*
Disallow /*?yili*
Disallow /*?m=*
Disallow /*?author=*
Disallow /sostavy-zvenev1/*
Disallow /caphits/free-agency
Disallow /Highlights
Disallow /?q=admin
Disallow /?q=comment
Disallow /?q=filter%2Ftips
Disallow /?q=node%2Fadd
Disallow /?q=search
Disallow /?q=user%2Fpassword
Disallow /?q=user%2Fregister
Disallow /?q=user%2Flogin
Disallow /?q=user%2Flogout

Other Records

Field Value
crawl-delay 100

Other Records

Field Value
sitemap https://nhl-news.ru/sitemap.xml

Comments

  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txtdata:image/gif;base64,R0lGODlhAQABAID/AMDAwAAAACH5BAEAAAAALAAAAAABAAEAAAICRAEAOw==
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • Paths (clean URLs) – после исправления!
  • Paths (no clean URLs) – после исправления!

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.