telanganatoday.com
robots.txt
Robots Exclusion Standard data for telanganatoday.com
Resource Scan
Scan Details
Site Domain | telanganatoday.com |
Base Domain | telanganatoday.com |
Scan Status | Ok |
Last Scan | 2024-05-11T06:42:20+00:00 |
Next Scan | 2024-05-18T06:42:20+00:00 |
Last Scan
Scanned | 2024-05-11T06:42:20+00:00 |
URL | https://telanganatoday.com/robots.txt |
Domain IPs | 54.192.18.107, 54.192.18.25, 54.192.18.48, 54.192.18.63 |
Response IP | 13.33.88.14 |
Found | Yes |
Hash | 75bfd7efcf5bec5e1c84021556131d477727d5545bbb94128937a5ccfe078c59 |
SimHash | e834d9c88dd3 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /wp-admin/ |
Disallow | /?s |
Disallow | /s?* |
Disallow | */embed |
Disallow | /?p= |
Disallow | /updates/* |
Disallow | */attachment/* |
Disallow | */category/* |
Disallow | */amp |
Disallow | *utm_* |
Disallow | */tags/* |
Disallow | /search/ |
Disallow | /404.html |
Disallow | *.aspx |
Disallow | /?s=* |
Disallow | /telangana-amp/ |
Other Records
Field | Value |
---|---|
sitemap | https://telanganatoday.com/sitemap_index.xml |
sitemap | https://telanganatoday.com/news-sitemap.xml |