machine.in.th
robots.txt

Robots Exclusion Standard data for machine.in.th

Resource Scan

Scan Details

Site Domain machine.in.th
Base Domain machine.in.th
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-08-24T08:47:43+00:00
Next Scan 2025-11-22T08:47:43+00:00

Last Successful Scan

Scanned2024-05-02T05:23:10+00:00
URL https://machine.in.th/robots.txt
Redirect https://www.machine.in.th/robots.txt
Redirect Domain www.machine.in.th
Redirect Base machine.in.th
Domain IPs 5.196.246.116
Redirect IPs 5.196.246.116
Response IP 5.196.246.116
Found Yes
Hash 62b186c7acc0681788ed0a35cf77800988808d0be49fb027810e59676a4384a8
SimHash 2c545cd1c513

Groups

*

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

teoma

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

nutch

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow

psbot

Rule Path
Disallow

asterias

Rule Path
Disallow /

yahoo-blogs/v3.9

Rule Path
Disallow
Disallow /css
Disallow /css/
Disallow /fckeditor
Disallow /fckeditor/
Disallow /java
Disallow /java/
Disallow /js
Disallow /js/
Disallow /language
Disallow /language/

Other Records

Field Value
sitemap https://www.machine.in.th/rss.php

Comments

  • Crawl-delay: 5