SEO report of scraping-book.com

Python クローリング&スクレイピング -データ収集・解析のための実践開発ガイド ...

www.scraping-book.com/

Error! The "meta description" is missing, the page has no summary description!


 Tasks

  • Select one version of your site as main and make a redirect from other versions to that one.
  • Avoid using deprecated HTML tags.

 SEO

URL

Domain : www.scraping-book.com/

Character length : 22

Title
Python クローリング&スクレイピング -データ収集・解析のための実践開発ガイド-
Keywords (meta keywords)
Python,クローリング,スクレイピング,Scrapy,Beautiful Soup,lxml,非同期I/O,画像認識,自然言語処理,データ解析,Selenium,PDF/XLSX解析,Web API,グラフ化,OAuth,オープンデータ,BigQuery

Error! Using “meta keywords” is meaningless in a while.
Open Graph Protocol

Good! The OG (Open Graph) protocol is set on this website.

type: books.book
title: Python クローリング&スクレイピング -データ収集・解析のための実践開発ガイド-
url: http://scraping-book.com/
image: http://scraping-book.com/images/book-cover-1200.png
description: Webデータ収集・解析の技法を基礎から実用まで徹底解説したPythonのクローリング・スクレイピング本

Dublin Core
Dublin Core is not used
Underscores in the URLs
Good! No underscore (_) found in the URLs.
Search engine friendly URLs
Good! The website uses SEO friendly URLs.
Checking the robots.txt file
The robots.txt file is missing!

 Social

Social Engagement

No info found.

 Content

Doctype
HTML 5
Encoding
Perfect! The character encoding is set: UTF-8.
Language
We have found the language localisation: ”ja”.
Title
Python クローリング&スクレイピング -データ収集・解析のための実践開発ガイド-

Character length : 47

Good! The title’s length is between 10 and 70 characters.
Text / HTML ratio
Ratio : 19%

Acceptable! The text / code ratio is between 15 and 25 percent.
Headings
H1H2H3H4H5H6
116800
Heading structure in the source code
  • <H1> Python クローリング&スクレイピング
  • <H2> データ収集・解析のための実践開発ガイド
  • <H3> 内容紹介
  • <H3> 書籍情報
  • <H3> 販売サイト
  • <H3> 著者ブログ
  • <H3> レビュー記事
  • <H3> 目次
  • <H4> 第1章 クローリング・スクレイピングとは何か
  • <H4> 第2章 Pythonではじめるクローリング・スクレイピング
  • <H4> 第3章 強力なライブラリの活用
  • <H4> 第4章 実用のためのメソッド
  • <H4> 第5章 クローリング・スクレイピングの実践とデータの活用
  • <H4> 第6章 フレームワーク Scrapy
  • <H4> 第7章 クローラーの継続的な運用・管理
  • <H4> Appendix Vagrantによる開発環境の構築
Word cloud
  • まとめ7
  • ツイート2
Keyword matrix
wordtitledescriptionsheading
まとめ
ツイート
404 Page
The website has a 404 error page.
Flash content
Good! The website does not have any flash contents.
Frame
Good! The website does not use iFrame solutions.
Images
We found 1 images on this web page.

Good! Every image has an alternative text attributes set on this website.

 Technologies

Deprecated HTML elements
Good! No deprecated HTML tags are detected.
Redirection (www / not www)
Error! The web address is accessible with and without www!
Deprecated HTML elements
Good! No deprecated HTML tags are detected.
Printability
Suggestion! Unfortunately, no printer-friendly CSS found.
Meta Tag (viewport tag, mobile devices)
Error! The meta tag named viewport is missing.

 Speed test

Server response time
The server response time is fast enough.
Table layout
Good! No nested tables found.
Number of HTTP resources
26
Number of source domains
9
Render blocking resources
The elements below are blocking the “above the fold” rendering.
List of render blocking css files
  • http://www.scraping-book.com/css/style.css

 Speed test – Javascript

Javascript
Good! Just a few javascript files are detected on the website.
  • http://www.scraping-book.com/js/top.js
File size of all javascript files combined
950.50KB
Javascript minifying
You can save 159B (41% compression) on the analysed URL by minifying the javascript files.

 Speed test – CSS

CSS
Good! Just a few CSS files are used on this website.
  • http://www.scraping-book.com/css/style.css
File size of all css files combined
7.41KB
CSS minifying
You can save 395B (20% compression) on the analysed URL by minifying the CSS files.

 Speed test – Compression

Uncompressed size of the of the HTML
232.93KB
Gzip compression
Your site uses compression.

 Speed test – Browser cache

Number of static resources (image, JS, CSS)
15
Browser cache
The browser cache is not set correctly for all elements.
URLDuration
http://www.scraping-book.com/css/style.cssExpiry time is not specified
http://www.scraping-book.com/images/bg@650w.jpgExpiry time is not specified
http://www.scraping-book.com/images/book_image@2x.jpgExpiry time is not specified
http://www.scraping-book.com/images/icon_medal.pngExpiry time is not specified
http://www.scraping-book.com/images/orangain_logo.pngExpiry time is not specified
http://www.scraping-book.com/images/quote-left.svgExpiry time is not specified
http://www.scraping-book.com/js/top.jsExpiry time is not specified
https://syndication.twitter.com/settings10 minutes
http://connect.facebook.net/ja_JP/sdk.js20 minutes
http://platform.twitter.com/widgets.js30 minutes
https://www.google-analytics.com/analytics.js2 hours

 Speed test – Images

File size of all images combined
436.49KB
Image optimisation
You can save 211.7KB (50% compression) by optimising the images below:

 Links

We found a total of 11 different links.
Internal links: 1
External links: 10

External links:

Link text (anchor) Link strength

Internal links:

Link text (anchor) Link strength

 Website security

IP
219.94.162.21
External hidden links
Good! No hidden external links found
Looking for eval()
Good! No eval(bas64_decode()) scripts are found
Checking for XSS vulnerability
No XSS vulnerability found
Email encryption
Good! We have not found any unencrypted email addresses.

 Sites on same ip

iphone7-order.com

scraping-book.com

travel-netyoyaku.com

gothealthywithjuiceplus.com

osakahost.net

giraffeworks.com

grandparentsmagazine.net

falkor-system.com

stereo2software.com

nailjewel.jp

 Icons

Favicon
Good! The website uses favicon.

 Typos

craping-book.com, sqcraping-book.com, qcraping-book.com, swcraping-book.com, wcraping-book.com, secraping-book.com, ecraping-book.com, szcraping-book.com, zcraping-book.com, sxcraping-book.com, xcraping-book.com, sccraping-book.com, ccraping-book.com, sraping-book.com, scxraping-book.com, sxraping-book.com, scsraping-book.com, ssraping-book.com, scraping-book.com, sraping-book.com, scdraping-book.com, sdraping-book.com, scfraping-book.com, sfraping-book.com, scvraping-book.com, svraping-book.com, sc raping-book.com, s raping-book.com, scaping-book.com, screaping-book.com, sceaping-book.com, scrdaping-book.com, scdaping-book.com, scrfaping-book.com, scfaping-book.com, scrgaping-book.com, scgaping-book.com, scr4,aping-book.com, sc4,aping-book.com, scrtaping-book.com, sctaping-book.com, scr5aping-book.com, sc5aping-book.com, scrping-book.com, scraqping-book.com, scrqping-book.com, scrawping-book.com, scrwping-book.com, scrazping-book.com, scrzping-book.com, scraping-book.com, scrping-book.com, scraxping-book.com, scrxping-book.com, scrasping-book.com, scrsping-book.com, scraing-book.com, scrapoing-book.com, scraoing-book.com, scrapling-book.com, scraling-book.com, scrap0ing-book.com, scra0ing-book.com, scrap-ing-book.com, scra-ing-book.com, scraping-book.com, scraing-book.com, scrap_ing-book.com, scra_ing-book.com, scrapng-book.com, scrapiung-book.com, scrapung-book.com, scrapijng-book.com, scrapjng-book.com, scraping-book.com, scrapng-book.com, scrapilng-book.com, scraplng-book.com, scrapiong-book.com, scrapong-book.com, scrapi8ng-book.com, scrap8ng-book.com, scrapi9ng-book.com, scrap9ng-book.com, scrapi*ng-book.com, scrap*ng-book.com, scrapig-book.com, scrapinbg-book.com, scrapibg-book.com, scrapingg-book.com, scrapigg-book.com, scrapinhg-book.com, scrapihg-book.com, scrapinjg-book.com, scrapijg-book.com, scrapinmg-book.com, scrapimg-book.com, scrapin g-book.com, scrapi g-book.com, scrapin-book.com, scrapingr-book.com, scrapinr-book.com, scrapingf-book.com, scrapinf-book.com, scrapingv-book.com, scrapinv-book.com, scrapingc-book.com, scrapinc-book.com, scrapingb-book.com, scrapinb-book.com, scrapingy-book.com, scrapiny-book.com, scrapingh-book.com, scrapinh-book.com, scrapingn-book.com, scrapinn-book.com, scrapingbook.com, scraping-=book.com, scraping=book.com, scraping-_book.com, scraping_book.com, scraping-0book.com, scraping0book.com, scraping-+book.com, scraping+book.com, scraping-*book.com, scraping*book.com, scraping-9book.com, scraping9book.com, scraping-ook.com, scraping-bvook.com, scraping-vook.com, scraping-bfook.com, scraping-fook.com, scraping-bgook.com, scraping-gook.com, scraping-book.com, scraping-ook.com, scraping-bhook.com, scraping-hook.com, scraping-bnook.com, scraping-nook.com, scraping-b ook.com, scraping- ook.com, scraping-bok.com, scraping-boiok.com, scraping-biok.com, scraping-bokok.com, scraping-bkok.com, scraping-bolok.com, scraping-blok.com, scraping-book.com, scraping-bok.com, scraping-bopok.com, scraping-bpok.com, scraping-bo9ok.com, scraping-b9ok.com, scraping-bo0ok.com, scraping-b0ok.com, scraping-bok.com, scraping-booik.com, scraping-boik.com, scraping-bookk.com, scraping-bokk.com, scraping-boolk.com, scraping-bolk.com, scraping-book.com, scraping-bok.com, scraping-boopk.com, scraping-bopk.com, scraping-boo9k.com, scraping-bo9k.com, scraping-boo0k.com, scraping-bo0k.com

More Sites

  • Title: Hilary Kinney - Home
  • Description:
  • Sites loading time: 1744
  • Internet Protocol (IP) address: 72.34.53.253
  • Javascript total size: 156.02KB
  • CSS total size: 8.22KB
  • Image total size: 3.31KB
  • Total size: 170.58KB
  • Tech:
    • Other
      • CSS (Cascading Style Sheets)
      • Html (HyperText Markup Language)
      • Html5
      • Javascript
      • Php (Hypertext Preprocessor)
  • Title: Home - Global Hotels & Resorts
  • Description: Site global-hotels.com
  • Sites loading time: 7307
  • Internet Protocol (IP) address: 188.120.33.98
  • Javascript total size: 68.38KB
  • CSS total size: 22.40KB
  • Image total size: 303.00KB
  • Total size: 492.94KB
  • Tech:
    • Analytic
      • Google Analytics
    • Other
      • CSS (Cascading Style Sheets)
      • Html (HyperText Markup Language)
      • Javascript
      • Php (Hypertext Preprocessor)
  • Title: Froschnet
  • Description:
  • Sites loading time: 3006
  • Internet Protocol (IP) address: 87.230.14.165
  • Javascript total size: 115.94KB
  • CSS total size: 73.56KB
  • Image total size: 4.16MB
  • Total size: 4.52MB
  • Tech:
    • CMS
      • Wordpress CMS
    • Other
      • CSS (Cascading Style Sheets)
      • Google Font API
      • Html (HyperText Markup Language)
      • Html5
      • Javascript
      • jQuery
      • Php (Hypertext Preprocessor)
      • Pingback
      • SVG (Scalable Vector Graphics)
      • Swf Object
  • Title: Worldsoft AG CMS website
  • Description:
  • Sites loading time: 1513
  • Internet Protocol (IP) address: 217.196.177.100
  • Javascript total size: 530.96KB
  • CSS total size: 293.80KB
  • Image total size: 245.64KB
  • Total size: 1.14MB
  • Tech:
    • CMS
      • Xoops CMS
    • Social
      • Add This
    • Other
      • CSS (Cascading Style Sheets)
      • Font Awesome
      • Html (HyperText Markup Language)
      • Html5
      • Iframe
      • Javascript
      • jQuery UI
      • Php (Hypertext Preprocessor)
  • Title: Homeópata en Madrid, Doctor Luis Díaz Vidal
  • Description: Consulta de homeopatía del doctor Díaz Vidal en Madrid. En la calle Goya y en Las Rozas de Madrid
  • Sites loading time: 1554
  • Internet Protocol (IP) address: 217.160.230.108
  • Javascript total size: 314.71KB
  • CSS total size: 59.85KB
  • Image total size: 0.99MB
  • Total size: 1.39MB
  • Tech:
    • CMS
      • Wordpress CMS
    • Analytic
      • Google Analytics
    • Social
      • Twitter Button
    • Other
      • CSS (Cascading Style Sheets)
      • Flexslider
      • Google Font API
      • Html (HyperText Markup Language)
      • Html5
      • Javascript
      • jQuery
      • Php (Hypertext Preprocessor)
      • Pingback
  • Title: Homevalueappraisal.com
  • Description:
  • Sites loading time: 464
  • Internet Protocol (IP) address: 208.91.197.27
  • Javascript total size: 0.00B
  • CSS total size: 0.00B
  • Image total size: 0.00B
  • Total size: 272.00B
  • Tech:
    • Other
      • CSS (Cascading Style Sheets)
      • Html (HyperText Markup Language)
      • Javascript
      • Php (Hypertext Preprocessor)
  • Title: ali:cia design
  • Description:
  • Sites loading time: 388
  • Internet Protocol (IP) address: 46.30.215.5
  • Javascript total size: 0.00B
  • CSS total size: 0.00B
  • Image total size: 190.18KB
  • Total size: 191.46KB
  • Tech:
    • Other
      • CSS (Cascading Style Sheets)
      • Html (HyperText Markup Language)
  • Title: yffurcapital.com
  • Description:
  • Sites loading time: 198
  • Internet Protocol (IP) address: 85.233.160.24
  • Javascript total size: 0.00B
  • CSS total size: 0.00B
  • Image total size: 0.00B
  • Total size: 444.00B
  • Tech:
    • Other
      • CSS (Cascading Style Sheets)
      • Html (HyperText Markup Language)
      • Html5
      • Iframe
  • Title: Business Druid — Improve Your Business Image
  • Description:
  • Internet Protocol (IP) address: 75.126.137.82
  • Tech:
    • Other
      • CSS (Cascading Style Sheets)
      • Html (HyperText Markup Language)
  • Title: Monson Flooring | Commercial Flooring Repair and Removal
  • Description:
  • Internet Protocol (IP) address: 69.195.124.226
  • Tech:
    • CMS
      • Wordpress CMS
    • Other
      • CSS (Cascading Style Sheets)
      • Google Font API
      • Html (HyperText Markup Language)
      • Html5
      • Iframe
      • Javascript
      • jQuery
      • Php (Hypertext Preprocessor)
      • Pingback
      • Shortcodes