cdn.telanganatoday.com
robots.txt

Robots Exclusion Standard data for cdn.telanganatoday.com

Resource Scan

Scan Details

Site Domain cdn.telanganatoday.com
Base Domain telanganatoday.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-07-02T13:35:57+00:00
Next Scan 2024-07-09T13:35:57+00:00

Last Successful Scan

Scanned2024-06-24T13:35:16+00:00
URL https://cdn.telanganatoday.com/robots.txt
Domain IPs 13.33.88.14, 13.33.88.15, 13.33.88.24, 13.33.88.62
Response IP 13.33.88.24
Found Yes
Hash 75bfd7efcf5bec5e1c84021556131d477727d5545bbb94128937a5ccfe078c59
SimHash e834d9c88dd3

Groups

*

Rule Path
Allow /
Disallow /wp-admin/
Disallow /?s
Disallow /s?*
Disallow */embed
Disallow /?p=
Disallow /updates/*
Disallow */attachment/*
Disallow */category/*
Disallow */amp
Disallow *utm_*
Disallow */tags/*
Disallow /search/
Disallow /404.html
Disallow *.aspx
Disallow /?s=*
Disallow /telangana-amp/

googlebot

Rule Path
Disallow /?s
Disallow /s?*
Disallow */embed
Disallow /?p=
Disallow /updates/*
Disallow */attachment/*
Disallow */category/*
Disallow */amp
Disallow *utm_*
Disallow */tags/*
Disallow /search/
Disallow /404.html
Disallow *.aspx
Disallow /?s=*
Disallow /telangana-amp/

Other Records

Field Value
sitemap https://telanganatoday.com/sitemap_index.xml
sitemap https://telanganatoday.com/news-sitemap.xml