1r1g.cn
robots.txt
Robots Exclusion Standard data for 1r1g.cn
Resource Scan
Scan Details
Site Domain | 1r1g.cn |
Base Domain | 1r1g.cn |
Scan Status | Ok |
Last Scan | 2024-07-01T23:47:16+00:00 |
Next Scan | 2024-07-08T23:47:16+00:00 |
Last Scan
Scanned | 2024-07-01T23:47:16+00:00 |
URL | https://1r1g.cn/robots.txt |
Domain IPs | 119.91.27.230 |
Response IP | 119.91.27.230 |
Found | Yes |
Hash | 423344e966857e666a9cf232c27d6ef33c91ecc43e4e8e37fd334b96cc2cafa2 |
SimHash | 3a981cd1b0e2 |
Groups
proximic
bizinformation
blexbot
riddler
ltx71
magpie-crawler
grapeshot
grapeshotcrawler
gigablastopensource
bubing
linkdexbot
linkdexbot/2.2
seokicks
seokicks-robot
panscient.com
webdatastats
zoominfobot
Rule | Path |
---|---|
Disallow | / |
yandex
yandexbot
yandexmobilebot
yandeximageresizer
coccocbot
coccocbot-web
coccocbot-image
yeti
seekbot
seekport
seekport crawler
Rule | Path |
---|---|
Disallow | / |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
*
Rule | Path |
---|---|
Disallow | /upload/ |
*
Rule | Path |
---|---|
Disallow | /media/ |
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Other Records
Field | Value |
---|---|
sitemap | https://qa.1r1g.cn/sitemap.xml |
sitemap | https://qa.1r1g.cn/superuser/sitemap.xml |
sitemap | https://qa.1r1g.cn/askubuntu/sitemap.xml |
sitemap | https://qa.1r1g.cn/serverfault/sitemap.xml |
sitemap | https://qa.1r1g.cn/unix/sitemap.xml |
Warnings
- 2 invalid lines.
Comments