getui.com
robots.txt

Robots Exclusion Standard data for getui.com

Resource Scan

Scan Details

Site Domain getui.com
Base Domain getui.com
Scan Status Ok
Last Scan2024-09-20T18:09:57+00:00
Next Scan 2024-10-04T18:09:57+00:00

Last Scan

Scanned2024-09-20T18:09:57+00:00
URL https://getui.com/robots.txt
Domain IPs 115.236.20.203
Response IP 115.236.20.203
Found Yes
Hash 07dd1b2c35b98ed4a9f78535bac1e5fd8d8611ce0acb78351e6e3ae458bbcfee
SimHash 02453b0a9952

Groups

*

Rule Path
Allow /
Disallow /**/components/
Disallow www.getui.com/cn/newsfeed/2015/12/1204120.html
Disallow www.getui.com/cn/newsfeed/2014/12/1205103.html

Other Records

Field Value
sitemap https://www.getui.com/sitemap.xml

Warnings

  • 43 invalid lines.