abc.com
robots.txt

Robots Exclusion Standard data for abc.com

Resource Scan

Scan Details

Site Domain abc.com
Base Domain abc.com
Scan Status Ok
Last Scan2024-06-20T21:45:02+00:00
Next Scan 2024-06-27T21:45:02+00:00

Last Scan

Scanned2024-06-20T21:45:02+00:00
URL https://abc.com/robots.txt
Domain IPs 13.35.18.104, 13.35.18.79, 13.35.18.80, 13.35.18.94
Response IP 13.35.18.79
Found Yes
Hash c87b3da907ff67e998be9000c19b8858b5dd9ac3263076c42c3ea0a1b6fcf0c3
SimHash 0904180867b0

Groups

*

Rule Path
Disallow /rss/
Disallow /xml/
Disallow /json/
Disallow /headerxml/
Disallow /service/
Disallow /util/
Disallow /vp2/
Disallow /embed/
Disallow /html/
Disallow /images/
Disallow /js/
Disallow /lib/
Disallow /media/
Disallow /site/
Disallow /contact-us-thanks

mj12bot

Rule Path
Disallow

twitterbot

Rule Path
Disallow

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

trendkite-akashic-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://abc.com/sitemapindex-blogs.xml
sitemap https://abc.com/sitemapindex-episodes.xml
sitemap https://abc.com/sitemapindex-showmap.xml
sitemap https://abc.com/sitemapindex-videomap.xml
sitemap https://abc.com/latest-blogs.xml
sitemap https://abc.com/live-channels.xml

Comments

  • Block trendkite-akashic-crawler