cswarzone.com
robots.txt

Robots Exclusion Standard data for cswarzone.com

Resource Scan

Scan Details

Site Domain cswarzone.com
Base Domain cswarzone.com
Scan Status Ok
Last Scan2024-11-14T21:07:04+00:00
Next Scan 2024-11-21T21:07:04+00:00

Last Scan

Scanned2024-11-14T21:07:04+00:00
URL https://cswarzone.com/robots.txt
Domain IPs 65.108.104.232
Response IP 65.108.104.232
Found Yes
Hash b008ecb85a9fc930d98170700e3231b078cbe405a062c8c71b3cc607b4b636f6
SimHash 62e4df40e431

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /*/*.css
Allow /*/*.js
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /readme.html
Disallow /license.txt
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-register.php
Disallow */disclaimer/*
Disallow *?attachment_id=
Disallow /privacy-policy

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

bingbot

Rule Path
Allow /

msnbot

Rule Path
Allow /

msnbot-media

Rule Path
Allow /wp-content/uploads/

applebot

Rule Path
Allow /

yandex

Rule Path
Allow /

yandeximages

Rule Path
Allow /wp-content/uploads/

slurp

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

qwantify

Rule Path
Allow /

baiduspider

Rule Path
Allow /

baiduspider/2.0

Rule Path
Allow /

baiduspider-video

Rule Path
Allow /

baiduspider-image

Rule Path
Allow /

sogou spider

Rule Path
Allow /

sogou web spider

Rule Path
Allow /

sosospider

Rule Path
Allow /

sosospider+

Rule Path
Allow /

sosospider/2.0

Rule Path
Allow /

yodao

Rule Path
Allow /

youdao

Rule Path
Allow /

youdaobot

Rule Path
Allow /

youdaobot/1.0

Rule Path
Allow /

ahrefsbot

Rule Path
Disallow /

alexibot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

xenu's

Rule Path
Disallow /

xenu's link sleuth 1.1c

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot/1.1

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

*

Rule Path
Allow /*.png*
Allow /*.jpg*
Allow /*.gif*
Allow /*.webp*
Disallow /search/
Disallow *?s=*
Disallow *?p=*
Disallow *%26p%3D*
Disallow *%26preview%3D*
Disallow /search

facebookexternalhit/1.0

Rule Path
Allow /

facebookexternalhit/1.1

Rule Path
Allow /

facebookplatform/1.0

Rule Path
Allow /

facebot/1.0

Rule Path
Allow /

visionutils/0.2

Rule Path
Allow /

datagnionbot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

linkedinbot/1.0

Rule Path
Allow /

pinterest/0.1

Rule Path
Allow /

pinterest/0.2

Rule Path
Allow /

*

Rule Path
Allow /ads.txt

*

Rule Path
Allow /app-ads.txt

Other Records

Field Value
sitemap https://www.cswarzone.com/sitemap_index.xml

Comments

  • Popular chinese search engines
  • Backlink Protector. Powered by Better Robots.txt Pro
  • Image Crawlability by search engines
  • Avoid crawler traps causing crawl budget issues
  • Social Media Crawling
  • Allow/Disallow Ads.txt
  • Allow/Disallow App-ads.txt
  • WARZONE