cdn.telanganatoday.com
robots.txt
Robots Exclusion Standard data for cdn.telanganatoday.com
Resource Scan
Scan Details
Site Domain | cdn.telanganatoday.com |
Base Domain | telanganatoday.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-07-02T13:35:57+00:00 |
Next Scan | 2024-07-09T13:35:57+00:00 |
Last Successful Scan
Scanned | 2024-06-24T13:35:16+00:00 |
URL | https://cdn.telanganatoday.com/robots.txt |
Domain IPs | 13.33.88.14, 13.33.88.15, 13.33.88.24, 13.33.88.62 |
Response IP | 13.33.88.24 |
Found | Yes |
Hash | 75bfd7efcf5bec5e1c84021556131d477727d5545bbb94128937a5ccfe078c59 |
SimHash | e834d9c88dd3 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /wp-admin/ |
Disallow | /?s |
Disallow | /s?* |
Disallow | */embed |
Disallow | /?p= |
Disallow | /updates/* |
Disallow | */attachment/* |
Disallow | */category/* |
Disallow | */amp |
Disallow | *utm_* |
Disallow | */tags/* |
Disallow | /search/ |
Disallow | /404.html |
Disallow | *.aspx |
Disallow | /?s=* |
Disallow | /telangana-amp/ |
Other Records
Field | Value |
---|---|
sitemap | https://telanganatoday.com/sitemap_index.xml |
sitemap | https://telanganatoday.com/news-sitemap.xml |