diverse-ways.com
robots.txt

Robots Exclusion Standard data for diverse-ways.com

Resource Scan

Scan Details

Site Domain diverse-ways.com
Base Domain diverse-ways.com
Scan Status Ok
Last Scan2025-12-23T07:13:25+00:00
Next Scan 2025-12-30T07:13:25+00:00

Last Scan

Scanned2025-12-23T07:13:25+00:00
URL https://diverse-ways.com/robots.txt
Domain IPs 163.43.80.48
Response IP 163.43.80.48
Found Yes
Hash 50b607fb8c3e4e219eeda136961a739e76a40ee42a6aa8d6e8c09a3908a8cee7
SimHash 3934c8413474

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Disallow /wp-content/uploads/wpo/wpo-plugins-tables-list.json

Other Records

Field Value
sitemap https://diverse-ways.com/sitemap.xml

Comments

  • XML Sitemap & Google News version 5.3.6 - https://status301.net/wordpress-plugins/xml-sitemap-feed/
  • ChatGPT関連のクローラーをブロック
  • ClaudeBot関連のクローラーをブロック
  • Google-Extendedを含むGoogleのクローラーをブロック
  • Common Crawlのクローラーをブロック