aggproxy.org
robots.txt

Robots Exclusion Standard data for aggproxy.org

Resource Scan

Scan Details

Site Domain aggproxy.org
Base Domain aggproxy.org
Scan Status Ok
Last Scan2025-12-16T07:24:40+00:00
Next Scan 2026-01-15T07:24:40+00:00

Last Scan

Scanned2025-12-16T07:24:40+00:00
URL https://aggproxy.org/robots.txt
Domain IPs 35.213.184.147
Response IP 35.213.184.147
Found Yes
Hash 9acee18c55c5c4da9208c5a8b9dfd3e762b7dfa29b48359c3234336488a9e691
SimHash ea004c2042af

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-content/
Disallow /wp-includes/
Disallow /xmlrpc.php
Disallow /wp-
Disallow /feed/
Disallow /*/feed
Disallow /trackback/
Disallow /*?*
Disallow /*.zip$
Disallow /*.rar$
Disallow /*.tar.gz$
Allow /wp-content/uploads/

gptbot

Rule Path
Allow /llms.txt
Disallow /

anthropic-ai

Rule Path
Allow /llms.txt
Disallow /

Other Records

Field Value
sitemap https://aggproxy.org/sitemap_index.xml

Comments

  • AI 爬虫访问控制