webcg.net
robots.txt

Robots Exclusion Standard data for webcg.net

Resource Scan

Scan Details

Site Domain webcg.net
Base Domain webcg.net
Scan Status Ok
Last Scan2024-09-17T21:37:19+00:00
Next Scan 2024-09-24T21:37:19+00:00

Last Scan

Scanned2024-09-17T21:37:19+00:00
URL https://webcg.net/robots.txt
Redirect https://www.webcg.net/robots.txt
Redirect Domain www.webcg.net
Redirect Base webcg.net
Domain IPs 202.238.151.62
Redirect IPs 202.238.151.62
Response IP 202.238.151.62
Found Yes
Hash 30083e6b088fcc7a97615dee279e97603ae3ce62ed942a7a03f83e64877cdc88
SimHash 214d8b14fb30

Groups

*

Rule Path
Allow /
Disallow /WEBCG/alliance/
Disallow /WEBCG/campaign/
Disallow /WEBCG/carnavi/
Disallow /WEBCG/carscope/
Disallow /WEBCG/cgtv/
Disallow /WEBCG/counter/
Disallow /WEBCG/essays/
Disallow /WEBCG/forum/
Disallow /WEBCG/FromOurStaff/
Disallow /WEBCG/impressions/
Disallow /WEBCG/magazines/
Disallow /WEBCG/mcg/
Disallow /WEBCG/members/
Disallow /WEBCG/members_top/
Disallow /WEBCG/navi2/
Disallow /WEBCG/naviaudio/
Disallow /WEBCG/news/
Disallow /WEBCG/qa/
Disallow /WEBCG/selection/
Disallow /WEBCG/showcase/
Disallow /WEBCG/specials/
Disallow /WEBCG/special_offer/
Disallow /WEBCG/test/
Disallow /resources/webcg/js/v5/main/
Disallow /resources/webcg/js/v1/smartphone/main/
Disallow /list/personal/premium/common-header-status-display
Disallow /auth/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.webcg.net/sitemap.xml