inegolonline.com
robots.txt

Robots Exclusion Standard data for inegolonline.com

Resource Scan

Scan Details

Site Domain inegolonline.com
Base Domain inegolonline.com
Scan Status Ok
Last Scan2024-11-15T03:45:19+00:00
Next Scan 2024-11-22T03:45:19+00:00

Last Scan

Scanned2024-11-15T03:45:19+00:00
URL https://inegolonline.com/robots.txt
Redirect https://www.inegolonline.com/robots.txt
Redirect Domain www.inegolonline.com
Redirect Base inegolonline.com
Domain IPs 104.22.2.162, 104.22.3.162, 172.67.23.183, 2606:4700:10::6816:2a2, 2606:4700:10::6816:3a2, 2606:4700:10::ac43:17b7
Redirect IPs 104.22.2.162, 104.22.3.162, 172.67.23.183, 2606:4700:10::6816:2a2, 2606:4700:10::6816:3a2, 2606:4700:10::ac43:17b7
Response IP 104.22.2.162
Found Yes
Hash 84d405428ae60f7c05347678ea095a440e92f4b134e7120a4a32ab49e39a48f6
SimHash 4111859f1703

Groups

*
*
mediapartners-google*
yandexnews
googlebot-news
googlebot-image
googlebot-mobile
msnbot
yahoo-mmcrawler
googlebot
baiduspider
yahoo-blogs/v3.9
psbot

Rule Path
Allow /
Allow /amp/*/haber/*/
Allow /*/haber/*/
Allow /tag/*
Disallow /cgi-bin/
Disallow /ajax/
Disallow /embed/
Disallow /json/
Disallow /inc/
Disallow /api/

Other Records

Field Value
sitemap https://www.inegolonline.com/sitemap.xml
sitemap https://www.inegolonline.com/sitemap-inegol-news.xml
sitemap https://www.inegolonline.com/sitemap-galeri.xml
sitemap https://www.inegolonline.com/sitemap-video.xml