2513142.xyz
robots.txt

Robots Exclusion Standard data for 2513142.xyz

Resource Scan

Scan Details

Site Domain 2513142.xyz
Base Domain 2513142.xyz
Scan Status Ok
Last Scan2025-04-03T01:38:17+00:00
Next Scan 2025-05-03T01:38:17+00:00

Last Scan

Scanned2025-04-03T01:38:17+00:00
URL https://2513142.xyz/robots.txt
Domain IPs 104.21.94.114, 172.67.222.191, 2606:4700:3034::ac43:debf, 2606:4700:3037::6815:5e72
Response IP 172.67.222.191
Found Yes
Hash eedf30de4a5be00db97dbc56b47181b1deaf817509d540a4d78280927647ff95
SimHash 29155e74c4aa

Groups

*

Rule Path
Disallow /admin/
Disallow /cgi-bin/
Disallow /tmp/
Disallow /private/
Disallow /backend/
Disallow /includes/
Disallow /install/
Disallow /misc/
Disallow /modules/
Disallow /profiles/
Disallow /scripts/
Disallow /themes/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-json/
Disallow /cron.php
Disallow /update.php
Disallow /install.php
Disallow /xmlrpc.php
Disallow /search/
Disallow /api/
Disallow /ajax/
Disallow /cdn-cgi/
Disallow /*?
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.docx$
Disallow /*.xls$
Disallow /*.xlsx$
Disallow /*.ppt$
Disallow /*.pptx$
Disallow /*.txt$
Disallow /*?*
Disallow /*%26*
Disallow /*.json$
Disallow /*.xml$
Disallow /*?s=*
Disallow /*?p=*
Disallow /tag/
Disallow /page/*
Disallow /trackback/
Disallow /comments/
Disallow /comment-page-/
Allow /images/
Allow /css/
Allow /js/
Allow /sitemap.xml
Allow /robots.txt
Allow /assets/
Allow /static/
Allow /public/img/
Allow /wp-content/uploads/

googlebot

Rule Path
Allow /articles/
Allow /blog/
Allow /products/
Allow /categories/
Allow /tags/
Allow /sitemap_index.xml
Allow /news/
Allow /resources/

Other Records

Field Value
crawl-delay 10

bingbot

Rule Path
Allow /articles/
Allow /blog/
Allow /products/
Allow /categories/
Allow /sitemap_index.xml

Other Records

Field Value
crawl-delay 10

baiduspider

Rule Path
Allow /articles/
Allow /blog/
Allow /products/
Allow /categories/
Allow /sitemap_index.xml

Other Records

Field Value
crawl-delay 10

yandexbot

Rule Path
Disallow /

duckduckbot

Rule Path
Allow /articles/
Allow /blog/
Allow /products/

Other Records

Field Value
crawl-delay 15

googlebot-image

Rule Path
Disallow /

bingbot-image

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

adsbot-google

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

twitterbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

Comments

  • å
  • å
  • 专门针对Google的规则
  • 专门针对Bing的规则
  • 专门针对百度的规则
  • 针对å
  • 禁止图片搜索引擎
  • 禁止广告和社交媒体爬虫
  • 禁止存档服务
  • 网站地图
  • Sitemap: https://www.example.com/sitemap.xml
  • Sitemap: https://www.example.com/sitemap_news.xml
  • Sitemap: https://www.example.com/sitemap_products.xml
  • Sitemap: https://www.example.com/sitemap_categories.xml

Warnings

  • 3 invalid lines.