wfc2021.org
robots.txt

Robots Exclusion Standard data for wfc2021.org

Resource Scan

Scan Details

Site Domain wfc2021.org
Base Domain wfc2021.org
Scan Status Ok
Last Scan2025-11-08T01:06:48+00:00
Next Scan 2025-12-08T01:06:48+00:00

Last Scan

Scanned2025-11-08T01:06:48+00:00
URL https://wfc2021.org/robots.txt
Domain IPs 104.21.41.80, 172.67.163.12, 2606:4700:3031::6815:2950, 2606:4700:3037::ac43:a30c
Response IP 104.21.41.80
Found Yes
Hash ed02c31f673de333a8fdb9f726ea6891272154cb662354a40dd9ea9564511d1c
SimHash 0e24d870e633

Groups

*

Rule Path
Disallow /search
Disallow /admin
Disallow /search?*
Disallow /search?search=
Disallow /*.pdf$
Disallow /?
Disallow /*?
Disallow /*?page=
Disallow /cgi-bin*
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://wfc2021.org/sitemap.xml