mashableonline.com
robots.txt

Robots Exclusion Standard data for mashableonline.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	mashableonline.com
Base Domain	mashableonline.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2024-07-16T23:22:49+00:00
Next Scan	2024-10-14T23:22:49+00:00

Last Successful Scan

Scanned	2024-02-25T23:13:28+00:00
URL	https://mashableonline.com/robots.txt
Domain IPs	68.65.123.97
Response IP	68.65.123.97
Found	Yes
Hash	28aab775235471c3f1615b25a2ba7ebbcd00637d27a8408020335f2e9290d19c
SimHash	480499717ea2

Groups

*

Rule	Path
Disallow	/wp-admin/
Disallow	/wp-content/uploads/wpo-plugins-tables-list.json
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/wp-admin/

Disallow

/wp-content/uploads/wpo-plugins-tables-list.json

Allow

/wp-admin/admin-ajax.php

googlebot

Rule	Path
Disallow

Rule

Path

Disallow

mediapartners-google*

Rule	Path
Disallow

Rule

Path

Disallow

googlebot-image

Rule	Path
Disallow

Rule

Path

Disallow

adsbot-google

Rule	Path
Disallow

Rule

Path

Disallow

adsbot-google-mobile

Rule	Path
Disallow

Rule

Path

Disallow

googlebot-mobile

Rule	Path
Disallow

Rule

Path

Disallow

googlebot-news

Rule	Path
Disallow

Rule

Path

Disallow

bingbot

Rule	Path
Disallow

Rule

Path

Disallow

msnbot

Rule	Path
Disallow

Rule

Path

Disallow

slurp

Rule	Path
Disallow

Rule

Path

Disallow

duckduckbot

Rule	Path
Disallow

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow

Rule

Path

Disallow

yandexbot

Rule	Path
Disallow

Rule

Path

Disallow

ia_archiver

Rule	Path
Disallow

Rule

Path

Disallow

teoma

Rule	Path
Disallow

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow

Rule

Path

Disallow

rogerbot/1.2

Rule	Path
Disallow

Rule

Path

Disallow

dotbot

Rule	Path
Disallow

Rule

Path

Disallow

dotbot/1.1

Rule	Path
Disallow

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow

Rule

Path

Disallow

ninjabot

Rule	Path
Disallow

Rule

Path

Disallow

facebot

Rule	Path
Disallow

Rule

Path

Disallow

twitterbot

Rule	Path
Disallow

Rule

Path

Disallow

linkedinbot

Rule	Path
Disallow

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.mashableonline.com/sitemap_index.xml
sitemap	https://www.mashableonline.com/post-sitemap.xml
sitemap	https://www.mashableonline.com/page-sitemap.xml
sitemap	https://www.mashableonline.com/category-sitemap.xml
sitemap	https://www.mashableonline.com/author-sitemap.xml

Field

Value

sitemap

https://www.mashableonline.com/sitemap_index.xml

sitemap

https://www.mashableonline.com/post-sitemap.xml

sitemap

https://www.mashableonline.com/page-sitemap.xml

sitemap

https://www.mashableonline.com/category-sitemap.xml

sitemap

https://www.mashableonline.com/author-sitemap.xml

Comments

Adding Multiple Sitemaps
Allowed Good User Agents for better Crawl

mashableonline.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

googlebot

mediapartners-google*

googlebot-image

adsbot-google

adsbot-google-mobile

googlebot-mobile

googlebot-news

bingbot

msnbot

slurp

duckduckbot

baiduspider

yandexbot

ia_archiver

teoma

rogerbot

rogerbot/1.2

dotbot

dotbot/1.1

ahrefsbot

mj12bot

semrushbot

ninjabot

facebot

twitterbot

linkedinbot

Other Records

Comments

mashableonline.com
robots.txt