agreenidea.org
robots.txt

Robots Exclusion Standard data for agreenidea.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	agreenidea.org
Base Domain	agreenidea.org
Scan Status	Ok
Last Scan	2024-06-01T06:47:06+00:00
Next Scan	2024-07-01T06:47:06+00:00

Last Scan

Scanned	2024-06-01T06:47:06+00:00
URL	http://agreenidea.org/robots.txt
Redirect	https://what.agreenidea.org/robots.txt
Redirect Domain	what.agreenidea.org
Redirect Base	agreenidea.org
Domain IPs	202.124.241.178
Redirect IPs	74.114.154.18, 74.114.154.22
Response IP	74.114.154.18
Found	Yes
Hash	a8cee2bb5253c3a78aed1976468bae9381e4866e5dd15941995ba4546ddb0b86
SimHash	639cca478486

Groups

*

Rule	Path
Disallow	/random
Disallow	/day
Disallow	/sticky-ad-iframe.html
Disallow	/privacy/consent

Rule

Path

Disallow

/random

Disallow

/day

Disallow

/sticky-ad-iframe.html

Disallow

/privacy/consent

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

1

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

sentibot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

/

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

/

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://what.agreenidea.org/sitemap.xml

Field

Value

sitemap

https://what.agreenidea.org/sitemap.xml

Back to top

Comments

Common Crawl's crawler
SentiBot's crawler
Google Bard's crawler
Facebook's crawler
webz.io's crawler
webz.io's crawler
Amazon's crawler
ClaudeBot's crawler
anthropic-ai's crawler
ImageSift's AI crawler

Back to top

agreenidea.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

ccbot

sentibot

google-extended

facebookbot

omgili

omgilibot

amazonbot

claudebot

anthropic-ai

imagesiftbot

Other Records

Comments

agreenidea.org
robots.txt