amwatch.com
robots.txt

Robots Exclusion Standard data for amwatch.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	amwatch.com
Base Domain	amwatch.com
Scan Status	Ok
Last Scan	2024-11-04T06:51:37+00:00
Next Scan	2024-11-18T06:51:37+00:00

Last Scan

Scanned	2024-11-04T06:51:37+00:00
URL	https://amwatch.com/robots.txt
Domain IPs	18.155.68.107, 18.155.68.34, 18.155.68.62, 18.155.68.96
Response IP	18.155.68.62
Found	Yes
Hash	3bfa56d1f5336c3a5428ac69fd7bd37d004b0ddc10f0da36e0ea8653d763321f
SimHash	381f9824a7e4

Groups

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

/

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/archive/
Disallow	/auth
Disallow	/user/addTrial
Disallow	/metrics
Disallow	/health
Disallow	/cache
Disallow	/esi
Disallow	/mark-variant-won
Disallow	/article/5253094
Disallow	/Sygdom___Sundhed/article5253094.ece
Disallow	/service/cbp

Rule

Path

Disallow

/archive/

Disallow

/auth

Disallow

/user/addTrial

Disallow

/metrics

Disallow

/health

Disallow

/cache

Disallow

/esi

Disallow

/mark-variant-won

Disallow

/article/5253094

Disallow

/Sygdom___Sundhed/article5253094.ece

Disallow

/service/cbp

Back to top

Other Records

Field	Value
sitemap	https://amwatch.com/sitemapindex.xml

Field

Value

sitemap

https://amwatch.com/sitemapindex.xml

Back to top

Comments

AI crawler reference
The link below provides instructions to what kind of content can be used to train AI models on this website
https://amwatch.com/ai.txt
Common crawl
OpenAI (ChatGPT)
OpenAI (ChatGPT realtime search)
Anthropic
Google (only AI crawler)

Back to top

amwatch.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

ccbot

gptbot

chatgpt-user

anthropic-ai

google-extended

*

Other Records

Comments

amwatch.com
robots.txt