wdfxfox34.com
robots.txt

Robots Exclusion Standard data for wdfxfox34.com

Resource Scan

Scan Details

Site Domain wdfxfox34.com
Base Domain wdfxfox34.com
Scan Status Ok
Last Scan2024-05-11T05:57:57+00:00
Next Scan 2024-05-18T05:57:57+00:00

Last Scan

Scanned2024-05-11T05:57:57+00:00
URL https://wdfxfox34.com/robots.txt
Domain IPs 192.104.182.109
Response IP 192.104.182.109
Found Yes
Hash a26e138ec9e88e34f4a244b462115a43d1f23ed9b1cb0c36ab63c305d55343fa
SimHash aa3e4a54a573

Groups

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /tncms/tracking/
Disallow /_services/
Disallow /tncms/block/
Disallow /tncms/tax/
Disallow /tncms/weather/
Disallow /tncms/admin/
Disallow /tncms/gtm/
Disallow /tncms/track/
Disallow /tncms/search/
Disallow /tncms/openweb/*
Disallow /tncms/openid2/
Disallow /tncms/messaging/
Disallow /tncms/media/
Disallow /tncms/webservice/
Disallow /tncms/counter/
Disallow /tncms/dmp/
Disallow /tncms/auth/
Disallow /marketplace/*action%3Dsrch
Disallow /content/tncms/assets/v3/form/
Disallow /content/tncms/form/
Disallow /tncms/user/
Disallow /users/admin/
Disallow /users/*/?
Disallow /tncms/disqus/
Disallow /newsletter/optimize
Disallow /newsletter/optimize/advertisement
Disallow /newsletter/optimize/breaking
Disallow /newsletter/optimize/daily_headlines
Disallow /newsletter/optimize/weather
Disallow /newsletter/optimize/weekly_best_of
Disallow /tncms/calendar/

Other Records

Field Value
sitemap https://www.wdfxfox34.com/sitemap.xml
sitemap https://www.wdfxfox34.com/tncms/sitemap/editorial.xml