saidulhassan.com
robots.txt

Robots Exclusion Standard data for saidulhassan.com

Resource Scan

Scan Details

Site Domain saidulhassan.com
Base Domain saidulhassan.com
Scan Status Ok
Last Scan2024-10-29T15:27:55+00:00
Next Scan 2024-11-28T15:27:55+00:00

Last Scan

Scanned2024-10-29T15:27:55+00:00
URL https://saidulhassan.com/robots.txt
Domain IPs 63.250.38.48
Response IP 63.250.38.48
Found Yes
Hash d5b349e0c9c644c31ef6cb11cb8974c355db32818bc359957f920dddbc8bccfe
SimHash 0c44cb808212

Groups

*

Rule Path
Disallow /blog/wp-admin/
Disallow /category/*/*
Disallow */trackback

baiduspider
baiduspider-ads
baiduspider-cpro
baiduspider-favo
baiduspider-news
baiduspider-video
baiduspider-image
yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap http://saidulhassan.com/sitemap.xml

Comments

  • For Behaving Bots
  • Bad Spiders
  • Sitemaps