en.prothomalo.com
robots.txt

Robots Exclusion Standard data for en.prothomalo.com

Resource Scan

Scan Details

Site Domain en.prothomalo.com
Base Domain prothomalo.com
Scan Status Ok
Last Scan2024-05-23T12:49:39+00:00
Next Scan 2024-06-06T12:49:39+00:00

Last Scan

Scanned2024-05-23T12:49:39+00:00
URL https://en.prothomalo.com/robots.txt
Domain IPs 104.17.144.114, 104.17.145.114, 2606:4700::6811:9072, 2606:4700::6811:9172
Response IP 104.17.145.114
Found Yes
Hash 65c95965957c30b594ee266581cf0bb25e42e95ad0cb8dc6e2b33819d40d2d47
SimHash 82049a02e713

Groups

*

Rule Path
Allow /
Disallow /shell.html
Disallow /api/auth/
Disallow /api/comments/get_comments_json

Other Records

Field Value
sitemap https://en.prothomalo.com/sitemap.xml
sitemap https://en.prothomalo.com/news_sitemap.xml