media.singtao.ca
robots.txt

Robots Exclusion Standard data for media.singtao.ca

Resource Scan

Scan Details

Site Domain media.singtao.ca
Base Domain singtao.ca
Scan Status Ok
Last Scan2024-05-22T03:02:52+00:00
Next Scan 2024-05-29T03:02:52+00:00

Last Scan

Scanned2024-05-22T03:02:52+00:00
URL https://media.singtao.ca/robots.txt
Domain IPs 104.26.14.193, 104.26.15.193, 172.67.69.213, 2606:4700:20::681a:ec1, 2606:4700:20::681a:fc1, 2606:4700:20::ac43:45d5
Response IP 172.67.69.213
Found Yes
Hash 71d5cc2abc046d685aea0b1fca66684cde20bc377ff72702ebe40525da23a035
SimHash 7b6ad4d2e413

Groups

googlebot
googlebot-news
mediapartners-google
bingbot
msnbot
slurp
yandex
baiduspider
alexabot
facebot
ia_archiver

Rule Path
Allow /

Other Records

Field Value
crawl-delay 300

sosospider

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/upgrade/
Disallow /wp-content/wflogs/
Disallow /wp-content/rsspi-log/
Disallow /wp-content/languages/
Disallow /dushi/
Disallow /dushi-single/
Disallow /deals/blackfriday/
Disallow /deals/singclub/
Disallow /readme.html
Disallow /refer/
Disallow /?s=
Disallow /search/
Disallow /daily_

Other Records

Field Value
sitemap https://www.singtao.ca/gen-sitemap/post/today/
sitemap https://www.singtao.ca/gen-sitemap/rtnews/today/