aws.org
robots.txt

Robots Exclusion Standard data for aws.org

Resource Scan

Scan Details

Site Domain aws.org
Base Domain aws.org
Scan Status Ok
Last Scan2025-10-18T12:42:35+00:00
Next Scan 2025-11-17T12:42:35+00:00

Last Scan

Scanned2025-10-18T12:42:35+00:00
URL https://aws.org/robots.txt
Redirect https://www.aws.org/robots.txt
Redirect Domain www.aws.org
Redirect Base aws.org
Domain IPs 104.26.2.70, 104.26.3.70, 172.67.69.19, 2606:4700:20::681a:246, 2606:4700:20::681a:346, 2606:4700:20::ac43:4513
Redirect IPs 20.119.128.10
Response IP 20.119.128.10
Found Yes
Hash 9037556855977a1fade87cf53341afa5420d613661394f2ced054505b1977e81
SimHash 4d15f951c691

Groups

*

Rule Path
Allow /
Allow /shop/
Allow /*.js$
Allow /about/
Allow /*.css$
Disallow /dev/
Disallow /tag/
Disallow /cart/
Allow /contact/
Disallow /admin/
Disallow /login/
Disallow /print/
Disallow /files/
Allow /business/
Disallow /search/
Disallow /*?sort=
Allow /education/
Allow /corporate/
Allow /educators/
Disallow /cgi-bin/
Disallow /wp-json/
Disallow /scripts/
Disallow /staging/
Disallow /members/
Allow /membership/
Disallow /category/
Disallow /*?filter=
Allow /conferences/
Disallow /thank-you/
Allow /publications/
Allow /certification/
Disallow /old-content/
Disallow /confirmation/
Allow /career-resources/
Disallow /search-results/
Allow /magazines-and-media/
Allow /community-and-events/
Allow /standards-and-publications/

exabot

Rule Path
Allow /

youbot

Rule Path
Allow /

andibot

Rule Path
Allow /

phindbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

oai-searchbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

firecrawlagent

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.aws.org/sitemap.xml