what.agreenidea.org
robots.txt

Robots Exclusion Standard data for what.agreenidea.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	what.agreenidea.org
Base Domain	agreenidea.org
Scan Status	Ok
Last Scan	2024-06-26T13:05:19+00:00
Next Scan	2024-07-10T13:05:19+00:00

Last Scan

Scanned	2024-06-26T13:05:19+00:00
URL	https://what.agreenidea.org/robots.txt
Domain IPs	74.114.154.18, 74.114.154.22
Response IP	74.114.154.22
Found	Yes
Hash	d061b0c5602a245d7fe10b50f2acbd0ec2fff551839a99ec7da62e1e67af7d01
SimHash	6b9cca438486

Groups

*

Rule	Path
Disallow	/random
Disallow	/day
Disallow	/sticky-ad-iframe.html
Disallow	/privacy/consent

Rule

Path

Disallow

/random

Disallow

/day

Disallow

/sticky-ad-iframe.html

Disallow

/privacy/consent

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

1

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

sentibot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

/

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

/

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://what.agreenidea.org/sitemap.xml

Field

Value

sitemap

https://what.agreenidea.org/sitemap.xml

Back to top

Comments

Common Crawl's crawler
SentiBot's crawler
Google Bard's crawler
Facebook's crawler
webz.io's crawler
webz.io's crawler
Amazon's crawler
ClaudeBot's crawler
anthropic-ai's crawler
ImageSift's AI crawler
Apple's AI crawler

Back to top

what.agreenidea.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

ccbot

sentibot

google-extended

facebookbot

omgili

omgilibot

amazonbot

claudebot

anthropic-ai

imagesiftbot

applebot-extended

Other Records

Comments

what.agreenidea.org
robots.txt