slashdotblog.com
robots.txt

Robots Exclusion Standard data for slashdotblog.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	slashdotblog.com
Base Domain	slashdotblog.com
Scan Status	Ok
Last Scan	2024-09-25T17:42:50+00:00
Next Scan	2024-10-25T17:42:50+00:00

Last Scan

Scanned	2024-09-25T17:42:50+00:00
URL	https://slashdotblog.com/robots.txt
Domain IPs	199.188.200.112
Response IP	199.188.200.112
Found	Yes
Hash	474900cb644845060d9561fcc4af6382d79f0ebb42e7d0aacba4e58bfda8125b
SimHash	0234c360ffe0

Groups

*

Product	Comment
*	applies to all robots

Rule	Path
Disallow	/*/feed/$
Disallow	/?nonamp=1%2F
Disallow	/?amp=1
Disallow	/?noamp=mobile
Disallow	/wp-admin/
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/*/feed/$

Disallow

/?nonamp=1%2F

Disallow

/?amp=1

Disallow

/?noamp=mobile

Disallow

/wp-admin/

Allow

/wp-admin/admin-ajax.php

googlebot

Rule	Path
Disallow
Allow	/*

Rule

Path

Disallow

Allow

mediapartners-google*

Rule	Path
Disallow
Allow	/*

Rule

Path

Disallow

Allow

boomtrain-content-bot*

Rule	Path
Disallow
Allow	/*

Rule

Path

Disallow

Allow

googlebot-image

Rule	Path
Disallow
Allow	/*

Rule

Path

Disallow

Allow

adsbot-google

Rule	Path
Disallow
Allow	/*

Rule

Path

Disallow

Allow

googlebot-news

Rule	Path
Disallow
Allow	/*

Rule

Path

Disallow

Allow

bingbot

Rule	Path
Disallow

Rule

Path

Disallow

msnbot

Rule	Path
Disallow

Rule

Path

Disallow

slurp

Rule	Path
Disallow

Rule

Path

Disallow

duckduckbot

Rule	Path
Disallow

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow

Rule

Path

Disallow

yandexbot

Rule	Path
Disallow

Rule

Path

Disallow

ia_archiver

Rule	Path
Disallow

Rule

Path

Disallow

teoma

Rule	Path
Disallow

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow

Rule

Path

Disallow

rogerbot/1.2

Rule	Path
Disallow

Rule

Path

Disallow

dotbot

Rule	Path
Disallow

Rule

Path

Disallow

dotbot/1.1

Rule	Path
Disallow

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow
Allow	/*

Rule

Path

Disallow

Allow

mj12bot

Rule	Path
Disallow
Allow	/*

Rule

Path

Disallow

Allow

semrushbot

Rule	Path
Disallow
Allow	/*

Rule

Path

Disallow

Allow

ninjabot

Rule	Path
Disallow

Rule

Path

Disallow

facebot

Rule	Path
Disallow

Rule

Path

Disallow

twitterbot

Rule	Path
Disallow

Rule

Path

Disallow

linkedinbot

Rule	Path
Disallow

Rule

Path

Disallow

Comments

Adding Multiple Sitemaps
Allowed Good User Agents for better Crawl

Warnings

`https` is not a known field.

slashdotblog.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot

mediapartners-google*

boomtrain-content-bot*

googlebot-image

adsbot-google

googlebot-news

bingbot

msnbot

slurp

duckduckbot

baiduspider

yandexbot

ia_archiver

teoma

rogerbot

rogerbot/1.2

dotbot

dotbot/1.1

ahrefsbot

mj12bot

semrushbot

ninjabot

facebot

twitterbot

linkedinbot

Comments

Warnings

slashdotblog.com
robots.txt