sport.bigmir.net
robots.txt

Robots Exclusion Standard data for sport.bigmir.net

Resource Scan

Scan Details

Site Domain sport.bigmir.net
Base Domain bigmir.net
Scan Status Ok
Last Scan2025-03-23T16:39:18+00:00
Next Scan 2025-04-22T16:39:18+00:00

Last Scan

Scanned2025-03-23T16:39:18+00:00
URL https://sport.bigmir.net/robots.txt
Domain IPs 3.160.212.53, 3.160.212.72, 3.160.212.81, 3.160.212.89
Response IP 3.164.68.2
Found Yes
Hash c23e41d770a65bfab72b64053d790933e8048ae966ced912bbf7bd0bd6608884
SimHash 0c049822a711

Groups

*

Rule Path
Disallow *tplmode%3Dprint
Disallow *tpl%3Dr_news_popular
Disallow */signin
Disallow */login
Disallow */share
Disallow */search
Disallow /click

slurp

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

nutch

Rule Path
Disallow /

psbot

Rule Path
Disallow /

asterias

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sport.bigmir.net/sitemap.xml
sitemap https://sport.bigmir.net/sitemap.news.xml

Warnings

  • `host` is not a known field.