pijushsaha.com
robots.txt
Robots Exclusion Standard data for pijushsaha.com
Resource Scan
Scan Details
Site Domain | pijushsaha.com |
Base Domain | pijushsaha.com |
Scan Status | Ok |
Last Scan | 2025-09-10T06:54:29+00:00 |
Next Scan | 2025-10-10T06:54:29+00:00 |
Last Scan
Scanned | 2025-09-10T06:54:29+00:00 |
URL | https://pijushsaha.com/robots.txt |
Domain IPs | 104.21.19.100, 172.67.185.190, 2606:4700:3031::ac43:b9be, 2606:4700:3033::6815:1364 |
Response IP | 104.21.19.100 |
Found | Yes |
Hash | da44eb2b61d411a5515ad353de56eedf9e4c88c36e42b12675e0e248211b78e5 |
SimHash | 60109af18277 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | */?replytocom= |
Disallow | */?partial-prev= |
Disallow | */feed |
Disallow | */?s |
Disallow | */?id |
Disallow | */embed |
Other Records
Field | Value |
---|---|
sitemap | https://pijushsaha.com/sitemap_index.xml |
sitemap | https://pijushsaha.com/post-sitemap.xml |
sitemap | https://pijushsaha.com/author-sitemap.xml |