wjfw.com
robots.txt

Robots Exclusion Standard data for wjfw.com

Resource Scan

Scan Details

Site Domain wjfw.com
Base Domain wjfw.com
Scan Status Ok
Last Scan2024-05-29T05:50:08+00:00
Next Scan 2024-06-05T05:50:08+00:00

Last Scan

Scanned2024-05-29T05:50:08+00:00
URL https://wjfw.com/robots.txt
Domain IPs 192.104.183.109
Response IP 192.104.183.109
Found Yes
Hash d3607584f34373cfe63cced25d9a275c516c0e8c4e9fa1b5cd396d998d508537
SimHash 3a3ec864e173

Groups

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /tncms/tracking/
Disallow /_services/
Disallow /tncms/block/
Disallow /tncms/track/
Disallow /tncms/tax/
Disallow /tncms/search/
Disallow /tncms/counter/
Disallow /tncms/messaging/
Disallow /tncms/auth/
Disallow /tncms/media/
Disallow /tncms/openweb/*
Disallow /tncms/weather/
Disallow /tncms/openid2/
Disallow /tncms/dmp/
Disallow /tncms/admin/
Disallow /tncms/webservice/
Disallow /tncms/gtm/
Disallow /tncms/user/
Disallow /users/admin/
Disallow /users/*/?
Disallow /tncms/disqus/
Disallow /marketplace/*action%3Dsrch
Disallow /content/tncms/assets/v3/form/
Disallow /content/tncms/form/
Disallow /tncms/calendar/
Disallow /calendar/search/

Other Records

Field Value
sitemap https://www.wjfw.com/sitemap.xml
sitemap https://www.wjfw.com/tncms/sitemap/editorial.xml
sitemap https://www.wjfw.com/tncms/sitemap/editorial.xml?year=2020
sitemap https://www.wjfw.com/tncms/sitemap/editorial.xml?year=2021
sitemap https://www.wjfw.com/tncms/sitemap/editorial.xml?year=2022
sitemap https://www.wjfw.com/tncms/sitemap/editorial.xml?year=2023