canligaste.com
robots.txt

Robots Exclusion Standard data for canligaste.com

Resource Scan

Scan Details

Site Domain canligaste.com
Base Domain canligaste.com
Scan Status Ok
Last Scan2025-09-28T16:17:25+00:00
Next Scan 2025-10-05T16:17:25+00:00

Last Scan

Scanned2025-09-28T16:17:25+00:00
URL https://canligaste.com/robots.txt
Domain IPs 104.21.85.108, 172.67.204.161, 2606:4700:3031::6815:556c, 2606:4700:3033::ac43:cca1
Response IP 104.21.85.108
Found Yes
Hash 2e11902ce95f1961b057e5ea4f80949eba51366ef036e624a7c2a459e1afe094
SimHash a80575444512

Groups

*

Rule Path
Disallow /arama/*
Disallow /pll.php?AnketId=*

Other Records

Field Value
sitemap https://www.canligaste.com/sitemap-seo/sitemap-seo-index.xml
sitemap https://www.canligaste.com/news-sitemap/news-sitemap-tumu.xml
sitemap https://www.canligaste.com/news-yandex/yandex-news.xml
sitemap https://www.canligaste.com/namaz-sitemap/namaz-sitemap.xml
sitemap https://www.canligaste.com/n-havadurumu-sitemap/hava-sitemap.xml
sitemap https://www.canligaste.com/websub-rss.xml
sitemap https://www.canligaste.com/sitemap_1.xml
sitemap https://www.canligaste.com/sitemap.xml