cmac.ws
robots.txt

Robots Exclusion Standard data for cmac.ws

Resource Scan

Scan Details

Site Domain cmac.ws
Base Domain cmac.ws
Scan Status Ok
Last Scan2024-10-10T14:51:22+00:00
Next Scan 2024-10-17T14:51:22+00:00

Last Scan

Scanned2024-10-10T14:51:22+00:00
URL https://cmac.ws/robots.txt
Redirect https://www.cmac.ws/robots.txt
Redirect Domain www.cmac.ws
Redirect Base cmac.ws
Domain IPs 104.248.118.35
Redirect IPs 104.248.118.35
Response IP 104.248.118.35
Found Yes
Hash 2d24fadf238ba393a8a2b86f14e0f4ddebee6f7ec33ffbbb341ef4515149d443
SimHash 401ef0b39c53

Groups

ahrefsbot

Rule Path
Disallow /

alphabot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

becomebot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

clickagy intelligence bot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

companybook crawler

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

daum

Rule Path
Disallow /

dnbcrawler-analytics

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

go-http-client

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

lemurwebcrawler

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

maxpointcrawler

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

nimbostratus-bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

repolookoutbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yandexwebmaster

Rule Path
Disallow /

*

Rule Path
Disallow /suggest.json
Disallow /ajax.json
Disallow /search/
Disallow /update/
Disallow /out/
Allow /submit/$
Disallow /submit/
Allow /search-reverse/$
Disallow /search-reverse/

mediapartners-google*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.cmac.ws/sitemap-index.gz

Warnings

  • 6 invalid lines.