probharat.com
robots.txt

Robots Exclusion Standard data for probharat.com

Resource Scan

Scan Details

Site Domain probharat.com
Base Domain probharat.com
Scan Status Ok
Last Scan2024-09-21T18:47:31+00:00
Next Scan 2024-09-28T18:47:31+00:00

Last Scan

Scanned2024-09-21T18:47:31+00:00
URL https://probharat.com/robots.txt
Domain IPs 131.153.231.79
Response IP 131.153.231.79
Found Yes
Hash 77844f89d52e2a59f6e8048bc55d0be2648fd325504a99959aadce4fda7d7769
SimHash 84747b0081b3

Groups

mediapartners-google

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

crystalsemanticsbot

Rule Path
Disallow /

discoverybot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

moget

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

paperlibot

Rule Path
Disallow /

scooperbot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

spinn3r

Rule Path
Disallow /
Allow /news/

yandex

Rule Path
Disallow /

yeti

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /news/
Disallow /hindi-news/
Disallow /indian-calendars/date.php
Disallow /astrology/panchangam.php
Disallow /news/update-news-sampurn.php
Disallow /news/logintest/
Disallow /movies/update-photo-sampurn.php
Disallow /movies/show-comments.php
Disallow /manage/
Disallow /news/manage.php
Disallow /movies/manage.php
Disallow /india-old/
Disallow /account/

Warnings

  • 2 invalid lines.