clck.adskeeper.com
robots.txt

Robots Exclusion Standard data for clck.adskeeper.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	clck.adskeeper.com
Base Domain	adskeeper.com
Scan Status	Ok
Last Scan	2026-02-19T23:02:28+00:00
Next Scan	2026-03-21T23:02:28+00:00

Last Scan

Scanned	2026-02-19T23:02:28+00:00
URL	https://clck.adskeeper.com/robots.txt
Domain IPs	104.18.35.150, 172.64.152.106, 2606:4700:4403::6812:2396, 2606:4700:4404::ac40:986a
Response IP	104.18.35.150
Found	Yes
Hash	0d50233f18dda636b429df382ec3a6a54630acaae6eda7495029ff133433f03d
SimHash	71140b61c211

Groups

*

Rule	Path
Disallow	/search
Disallow	/redirect
Disallow	/news
Disallow	/rnews
Disallow	/tnews
Disallow	/ghits
Disallow	/pnews
Disallow	/graphql
Disallow	/webinars
Disallow	/translator
Disallow	/dpd
Disallow	/landing/cpc/push-for-advertisers/

Rule

Path

Disallow

/search

Disallow

/redirect

Disallow

/news

Disallow

/rnews

Disallow

/tnews

Disallow

/ghits

Disallow

/pnews

Disallow

/graphql

Disallow

/webinars

Disallow

/translator

Disallow

/dpd

Disallow

/landing/cpc/push-for-advertisers/

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

gptbot

Rule	Path
Allow	/

Rule

Path

Allow

chatgpt-user

Rule	Path
Allow	/

Rule

Path

Allow

ccbot

Rule	Path
Allow	/

Rule

Path

Allow

perplexitybot

Rule	Path
Allow	/

Rule

Path

Allow

anthropic-ai

Rule	Path
Allow	/

Rule

Path

Allow

claude-web

Rule	Path
Allow	/

Rule

Path

Allow

claudebot

Rule	Path
Allow	/

Rule

Path

Allow

amazonbot

Rule	Path
Allow	/

Rule

Path

Allow

omgilibot

Rule	Path
Allow	/

Rule

Path

Allow

facebookbot

Rule	Path
Allow	/

Rule

Path

Allow

applebot

Rule	Path
Allow	/

Rule

Path

Allow

bytespider

Rule	Path
Allow	/

Rule

Path

Allow

diffbot

Rule	Path
Allow	/

Rule

Path

Allow

imagesiftbot

Rule	Path
Allow	/

Rule

Path

Allow

omgili

Rule	Path
Allow	/

Rule

Path

Allow

youbot

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
sitemap	https://clck.adskeeper.com/sitemap.xml

Field

Value

sitemap

https://clck.adskeeper.com/sitemap.xml

Warnings

`host` is not a known field.

clck.adskeeper.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

google-extended

gptbot

chatgpt-user

ccbot

perplexitybot

anthropic-ai

claude-web

claudebot

amazonbot

omgilibot

facebookbot

applebot

bytespider

diffbot

imagesiftbot

omgili

youbot

Other Records

Warnings

clck.adskeeper.com
robots.txt