pijushsaha.com
robots.txt

Robots Exclusion Standard data for pijushsaha.com

Resource Scan

Scan Details

Site Domain pijushsaha.com
Base Domain pijushsaha.com
Scan Status Ok
Last Scan2025-09-10T06:54:29+00:00
Next Scan 2025-10-10T06:54:29+00:00

Last Scan

Scanned2025-09-10T06:54:29+00:00
URL https://pijushsaha.com/robots.txt
Domain IPs 104.21.19.100, 172.67.185.190, 2606:4700:3031::ac43:b9be, 2606:4700:3033::6815:1364
Response IP 104.21.19.100
Found Yes
Hash da44eb2b61d411a5515ad353de56eedf9e4c88c36e42b12675e0e248211b78e5
SimHash 60109af18277

Groups

*

Rule Path
Allow /
Disallow */?replytocom=
Disallow */?partial-prev=
Disallow */feed
Disallow */?s
Disallow */?id
Disallow */embed

feedfetcher-google

Rule Path
Allow /*feed

yandexturbo/1.0

Rule Path
Allow /*feed

bingbot

Rule Path
Allow /*feed

Other Records

Field Value
sitemap https://pijushsaha.com/sitemap_index.xml
sitemap https://pijushsaha.com/post-sitemap.xml
sitemap https://pijushsaha.com/author-sitemap.xml