chengyucidian.18dao.net
robots.txt

Robots Exclusion Standard data for chengyucidian.18dao.net

Resource Scan

Scan Details

Site Domain chengyucidian.18dao.net
Base Domain 18dao.net
Scan Status Ok
Last Scan2025-10-19T08:24:25+00:00
Next Scan 2025-11-18T08:24:25+00:00

Last Scan

Scanned2025-10-19T08:24:25+00:00
URL https://chengyucidian.18dao.net/robots.txt
Domain IPs 104.26.0.126, 104.26.1.126, 172.67.74.114, 2606:4700:20::681a:17e, 2606:4700:20::681a:7e, 2606:4700:20::ac43:4a72
Response IP 104.26.1.126
Found Yes
Hash b49300120151027749a3ab5cf47a3315d13d041e8bce101fce5bab174212e0a8
SimHash 599c5f00c651

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Allow /sites/default/files/
Disallow /1027280/
Disallow /m/1027280/
Disallow /*comment/reply/*
Disallow /*image_captcha*
Disallow /cdn-cgi/
Disallow /includes/
Disallow /misc/
Disallow /modules/
Disallow /profiles/
Disallow /scripts/
Disallow /themes/
Disallow /CHANGELOG.txt
Disallow /cron.php
Disallow /INSTALL.mysql.txt
Disallow /INSTALL.pgsql.txt
Disallow /INSTALL.sqlite.txt
Disallow /install.php
Disallow /INSTALL.txt
Disallow /LICENSE.txt
Disallow /MAINTAINERS.txt
Disallow /update.php
Disallow /UPGRADE.txt
Disallow /xmlrpc.php
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips/
Disallow /node/add/
Disallow /search/
Disallow /user/register/
Disallow /user/password/
Disallow /user/login/
Disallow /user/logout/
Disallow /?q=admin%2F
Disallow /?q=comment%2Freply%2F
Disallow /?q=filter%2Ftips%2F
Disallow /?q=node%2Fadd%2F
Disallow /?q=search%2F
Disallow /?q=user%2Fpassword%2F
Disallow /?q=user%2Fregister%2F
Disallow /?q=user%2Flogin%2F
Disallow /?q=user%2Flogout%2F

Other Records

Field Value
sitemap https://chengyucidian.18dao.net/sitemap.xml
sitemap https://chengyucidian.18dao.net/rss.xml

Comments

  • robots.php
  • site: chengyucidian.18dao.net
  • jamesqi
  • 2013-9-11
  • modify start
  • 2023-4-8
  • sitemap start
  • sitemap end
  • modify end
  • 2023-7-12 # Crawl-delay: 10
  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)