mithilatoday.com
robots.txt

Robots Exclusion Standard data for mithilatoday.com

Resource Scan

Scan Details

Site Domain mithilatoday.com
Base Domain mithilatoday.com
Scan Status Ok
Last Scan2025-04-27T21:47:47+00:00
Next Scan 2025-05-04T21:47:47+00:00

Last Scan

Scanned2025-04-27T21:47:47+00:00
URL https://mithilatoday.com/robots.txt
Domain IPs 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001
Response IP 104.21.48.1
Found Yes
Hash c05b3e084522236770c3a0ce7a05606540cb0ad961520413045a6c7ec178fd7f
SimHash 51881b41e611

Groups

*

Rule Path
Allow /ads.txt
Disallow /wp-admin/
Disallow /adblocker
Disallow /?s=
Disallow /search/
Disallow /readme.html

Other Records

Field Value
crawl-delay 5

mediapartners-google

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

googlebot-news

Rule Path
Disallow /sponsored/

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

huggingface

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

mediapartners-google

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

twitterbot

Rule Path
Disallow

nimblecrawler

Rule Path
Disallow /

botrighthere

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

wget

Rule Path
Disallow /

cosmos

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

lexibot

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

builtbottough

Rule Path
Disallow /

backdoorbot/1.0

Rule Path
Disallow /

suzuran

Rule Path
Disallow /

openfind

Rule Path
Disallow /

repomonkey

Rule Path
Disallow /

iron33/1.0.2

Rule Path
Disallow /

getright/4.2

Rule Path
Disallow /

fairad client

Rule Path
Disallow /

gaisbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

searchpreview

Rule Path
Disallow /

callpod keeper

Rule Path
Disallow /

go-http-client

Rule Path
Disallow /

java browser

Rule Path
Disallow /

clickagy intelligence

Rule Path
Disallow /

uipbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.mithilatoday.com/sitemap-news.xml
sitemap https://www.mithilatoday.com/sitemap_index.xml
sitemap https://www.mithilatoday.com/post-sitemap1.xml
sitemap https://www.mithilatoday.com/post-sitemap2.xml
sitemap https://www.mithilatoday.com/category-sitemap.xml
sitemap https://www.mithilatoday.com/post_tag-sitemap.xml
sitemap https://www.mithilatoday.com/page-sitemap.xml

Comments

  • Sitemap