today.com
robots.txt

Robots Exclusion Standard data for today.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	today.com
Base Domain	today.com
Scan Status	Ok
Last Scan	2024-09-21T14:51:24+00:00
Next Scan	2024-09-28T14:51:24+00:00

Last Scan

Scanned	2024-09-21T14:51:24+00:00
URL	https://today.com/robots.txt
Redirect	https://www.today.com/robots.txt
Redirect Domain	www.today.com
Redirect Base	today.com
Domain IPs	34.206.62.195, 44.219.25.60, 52.37.236.218, 54.69.127.167
Redirect IPs	23.203.77.42
Response IP	184.87.132.131
Found	Yes
Hash	e0dd878d298a9f94cc0f6d9455260832ef36f159e9df7718c5264eb447c72f13
SimHash	701d1954a0e0

Groups

*

Rule	Path
Disallow	/search*
Disallow	/xml/today/SitemapToday*.xml
Disallow	/ajax*
Disallow	/bentoapi/

Rule

Path

Disallow

/search*

Disallow

/xml/today/SitemapToday*.xml

Disallow

/ajax*

Disallow

/bentoapi/

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

awariorssbot
awariosmartbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

cohere-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

diffbot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent
meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

newsnow

Rule	Path
Disallow	/

Rule

Path

Disallow

news-please

Rule	Path
Disallow	/

Rule

Path

Disallow

oai-searchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

peer39_crawler
peer39_crawler/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

scrapy

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.today.com/sitemap/today/sitemap-index
sitemap	https://www.today.com/sitemap/today/sitemap-news
sitemap	https://www.today.com/sitemap/today/sitemap-curations
sitemap	https://www.today.com/sitemap/today/sitemap-shop.xml

Field

Value

sitemap

https://www.today.com/sitemap/today/sitemap-index

sitemap

https://www.today.com/sitemap/today/sitemap-news

sitemap

https://www.today.com/sitemap/today/sitemap-curations

sitemap

https://www.today.com/sitemap/today/sitemap-shop.xml

Comments

Disallow Bots
Sitemaps

today.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

anthropic-ai

applebot-extended

awariorssbotawariosmartbot

bytespider

ccbot

chatgpt-user

claudebot

claude-web

cohere-ai

dataforseobot

diffbot

facebookbot

google-extended

gptbot

magpie-crawler

meta-externalagentmeta-externalagent

newsnow

news-please

oai-searchbot

omgili

omgilibot

peer39_crawlerpeer39_crawler/1.0

perplexitybot

scrapy

turnitinbot

Other Records

Comments

today.com
robots.txt

awariorssbot
awariosmartbot

meta-externalagent
meta-externalagent

peer39_crawler
peer39_crawler/1.0