agentm.tw
robots.txt

Robots Exclusion Standard data for agentm.tw

Resource Scan

Scan Details

Site Domain agentm.tw
Base Domain agentm.tw
Scan Status Ok
Last Scan2024-11-14T07:21:07+00:00
Next Scan 2024-11-21T07:21:07+00:00

Last Scan

Scanned2024-11-14T07:21:07+00:00
URL https://agentm.tw/robots.txt
Domain IPs 104.26.4.161, 104.26.5.161, 172.67.74.107, 2606:4700:20::681a:4a1, 2606:4700:20::681a:5a1, 2606:4700:20::ac43:4a6b
Response IP 172.67.74.107
Found Yes
Hash e4cb18fdbd6cd070466e00418324ba8efd87f6a38bda113dc024ab8ff8e14471
SimHash 0a5fdc60aeb2

Groups

semrushbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

buck

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

nimbostratus-bot

Rule Path
Disallow /

*

Rule Path
Disallow /v1/search_page*

Other Records

Field Value
sitemap https://www.agentm.tw/site_map.xml

Warnings

  • 2 invalid lines.