Sphinx + GitHub + ReadtheDocs 创建电子书

Read the Docs是一个在线文档托管服务, 你可以从各种版本控制系统中导入文档。一般来讲,这个非常适合写软件文档以及编写一些教程、电子书之类。

Sphinx

Sphinx 是一个基于 Python 的文档生成项目,最早只是用来生成 Python 官方文档,随着工具的完善, 越来越多的知名的项目也用他来生成文档,甚至完全可以用他来写书采用了 reStructuredText 作为文档写作语言, 不过也可以通过模块支持其他格式。

安装Sphinx

pip install sphinx sphinx-autobuild sphinx_rtd_theme

初始化

# 创建文档根目录
mkdir -p /root/work/scrapy-cookbook
cd scrapy-cookbook/
# 可以回车按默认配置来写
sphinx-quickstart

初始化时基本选择默认的即可,有几点需要注意的:

> Separate source and build directories (y/n) [n]:y
> Project name: scrapy-cookbook
> Author name(s): Name
> Project version []: 1.0
> Project release [1.0]: 1.0
> Project language [en]: zh_CN

添加一篇文章

新建 hello.rst,内容如下:

hello,world
=============

ndex.rst 修改如下:

Contents:
.. toctree::
   :maxdepth: 2

   hello

注意,空行。

更改主题 sphinx_rtd_theme

更改 conf.py:

import sphinx_rtd_theme
html_theme = "sphinx_rtd_theme"
html_theme_path = [sphinx_rtd_theme.get_html_theme_path()]

预览效果

然后在目录中执行:

make html

进入 build/html 目录后用浏览器打开index.html。

支持 markdown 编写

通过 recommonmark 来支持 markdown

pip install recommonmark

然后更改 conf.py:

from recommonmark.parser import CommonMarkParser
source_parsers = {
    '.md': CommonMarkParser,
}
source_suffix = ['.rst', '.md']

AutoStructify

如果想使用高级功能,可以添加 AutoStructify 配置,在 conf.py 中添加:

# At top on conf.py (with other import statements)
import recommonmark
from recommonmark.transform import AutoStructify

# At the bottom of conf.py
def setup(app):
    app.add_config_value('recommonmark_config', {
            'url_resolver': lambda url: github_doc_root + url,
            'auto_toc_tree_section': 'Contents',
            }, True)
    app.add_transform(AutoStructify)

可以直接使用下面的配置:

# -*- coding: utf-8 -*-
#
# Recommonmark documentation build configuration file, created by
# sphinx-quickstart on Tue Jul 28 11:17:27 2015.
#
# This file is execfile()d with the current directory set to its
# containing dir.
#
# Note that not all possible configuration values are present in this
# autogenerated file.
#
# All configuration values have a default; values that are commented out
# serve to show the default.

import sys
import os
import shlex

# If extensions (or modules to document with autodoc) are in another directory,
# add these directories to sys.path here. If the directory is relative to the
# documentation root, use os.path.abspath to make it absolute, like shown here.
sys.path.insert(0, os.path.abspath('..'))
import recommonmark
from recommonmark.transform import AutoStructify


source_suffix = ['.rst', '.md']

# -- General configuration ------------------------------------------------

# If your documentation needs a minimal Sphinx version, state it here.
#needs_sphinx = '1.0'

# Add any Sphinx extension module names here, as strings. They can be
# extensions coming with Sphinx (named 'sphinx.ext.*') or your custom
# ones.
extensions = [
    'sphinx.ext.autodoc',
    'sphinx.ext.napoleon',
    'sphinx.ext.mathjax',
    'recommonmark',
]

# Add any paths that contain templates here, relative to this directory.
templates_path = ['_templates']

# The encoding of source files.
#source_encoding = 'utf-8-sig'

# The master toctree document.
master_doc = 'index'

# General information about the project.
project = u'Recommonmark'
copyright = u'2015, Luca Barbato, Eric Holscher, and contributors'
author = u'Luca Barbato, Eric Holscher, and contributors'

github_doc_root = 'https://github.com/rtfd/recommonmark/tree/master/docs/'

# The version info for the project you're documenting, acts as replacement for
# |version| and |release|, also used in various other places throughout the
# built documents.
#
# The short X.Y version.
version = recommonmark.__version__
# The full version, including alpha/beta/rc tagss
release = recommonmark.__version__

# The language for content autogenerated by Sphinx. Refer to documentation
# for a list of supported languages.
#
# This is also used if you do content translation via gettext catalogs.
# Usually you set "language" from the command line for these cases.
language = None

# There are two options for replacing |today|: either, you set today to some
# non-false value, then it is used:
#today = ''
# Else, today_fmt is used as the format for a strftime call.
#today_fmt = '%B %d, %Y'

# List of patterns, relative to source directory, that match files and
# directories to ignore when looking for source files.
exclude_patterns = ['_build']

# The reST default role (used for this markup: `text`) to use for all
# documents.
default_role = None

# If true, '()' will be appended to :func: etc. cross-reference text.
#add_function_parentheses = True

# If true, the current module name will be prepended to all description
# unit titles (such as .. function::).
#add_module_names = True

# If true, sectionauthor and moduleauthor directives will be shown in the
# output. They are ignored by default.
#show_authors = False

# The name of the Pygments (syntax highlighting) style to use.
pygments_style = 'sphinx'

# A list of ignored prefixes for module index sorting.
#modindex_common_prefix = []

# If true, keep warnings as "system message" paragraphs in the built documents.
#keep_warnings = False

# If true, `todo` and `todoList` produce output, else they produce nothing.
todo_include_todos = False


# -- Options for HTML output ----------------------------------------------

# The theme to use for HTML and HTML Help pages.  See the documentation for
# a list of builtin themes.
html_theme = 'sphinx_rtd_theme'

# Theme options are theme-specific and customize the look and feel of a theme
# further.  For a list of options available for each theme, see the
# documentation.
#html_theme_options = {}

# Add any paths that contain custom themes here, relative to this directory.
#html_theme_path = []

# The name for this set of Sphinx documents.  If None, it defaults to
# "<project> v<release> documentation".
#html_title = None

# A shorter title for the navigation bar.  Default is the same as html_title.
#html_short_title = None

# The name of an image file (relative to this directory) to place at the top
# of the sidebar.
#html_logo = None

# The name of an image file (within the static path) to use as favicon of the
# docs.  This file should be a Windows icon file (.ico) being 16x16 or 32x32
# pixels large.
#html_favicon = None

# Add any paths that contain custom static files (such as style sheets) here,
# relative to this directory. They are copied after the builtin static files,
# so a file named "default.css" will overwrite the builtin "default.css".
html_static_path = ['_static']

# Add any extra paths that contain custom files (such as robots.txt or
# .htaccess) here, relative to this directory. These files are copied
# directly to the root of the documentation.
#html_extra_path = []

# If not '', a 'Last updated on:' timestamp is inserted at every page bottom,
# using the given strftime format.
#html_last_updated_fmt = '%b %d, %Y'

# If true, SmartyPants will be used to convert quotes and dashes to
# typographically correct entities.
#html_use_smartypants = True

# Custom sidebar templates, maps document names to template names.
#html_sidebars = {}

# Additional templates that should be rendered to pages, maps page names to
# template names.
#html_additional_pages = {}

# If false, no module index is generated.
#html_domain_indices = True

# If false, no index is generated.
#html_use_index = True

# If true, the index is split into individual pages for each letter.
#html_split_index = False

# If true, links to the reST sources are added to the pages.
#html_show_sourcelink = True

# If true, "Created using Sphinx" is shown in the HTML footer. Default is True.
#html_show_sphinx = True

# If true, "(C) Copyright ..." is shown in the HTML footer. Default is True.
#html_show_copyright = True

# If true, an OpenSearch description file will be output, and all pages will
# contain a <link> tag referring to it.  The value of this option must be the
# base URL from which the finished HTML is served.
#html_use_opensearch = ''

# This is the file name suffix for HTML files (e.g. ".xhtml").
#html_file_suffix = None

# Language to be used for generating the HTML full-text search index.
# Sphinx supports the following languages:
#   'da', 'de', 'en', 'es', 'fi', 'fr', 'hu', 'it', 'ja'
#   'nl', 'no', 'pt', 'ro', 'ru', 'sv', 'tr'
#html_search_language = 'en'

# A dictionary with options for the search language support, empty by default.
# Now only 'ja' uses this config value
#html_search_options = {'type': 'default'}

# The name of a javascript file (relative to the configuration directory) that
# implements a search results scorer. If empty, the default will be used.
#html_search_scorer = 'scorer.js'

# Output file base name for HTML help builder.
htmlhelp_basename = 'Recommonmarkdoc'

# -- Options for LaTeX output ---------------------------------------------

latex_elements = {
# The paper size ('letterpaper' or 'a4paper').
#'papersize': 'letterpaper',

# The font size ('10pt', '11pt' or '12pt').
#'pointsize': '10pt',

# Additional stuff for the LaTeX preamble.
#'preamble': '',

# Latex figure (float) alignment
#'figure_align': 'htbp',
}

# Grouping the document tree into LaTeX files. List of tuples
# (source start file, target name, title,
#  author, documentclass [howto, manual, or own class]).
latex_documents = [
  (master_doc, 'Recommonmark.tex', u'Recommonmark Documentation',
   u'Lu Zero, Eric Holscher, and contributors', 'manual'),
]

# The name of an image file (relative to this directory) to place at the top of
# the title page.
#latex_logo = None

# For "manual" documents, if this is true, then toplevel headings are parts,
# not chapters.
#latex_use_parts = False

# If true, show page references after internal links.
#latex_show_pagerefs = False

# If true, show URL addresses after external links.
#latex_show_urls = False

# Documents to append as an appendix to all manuals.
#latex_appendices = []

# If false, no module index is generated.
#latex_domain_indices = True


# -- Options for manual page output ---------------------------------------

# One entry per manual page. List of tuples
# (source start file, name, description, authors, manual section).
man_pages = [
    (master_doc, 'recommonmark', u'Recommonmark Documentation',
     [author], 1)
]

# If true, show URL addresses after external links.
#man_show_urls = False


# -- Options for Texinfo output -------------------------------------------

# Grouping the document tree into Texinfo files. List of tuples
# (source start file, target name, title, author,
#  dir menu entry, description, category)
texinfo_documents = [
  (master_doc, 'Recommonmark', u'Recommonmark Documentation',
   author, 'Recommonmark', 'One line description of project.',
   'Miscellaneous'),
]

# Documents to append as an appendix to all manuals.
#texinfo_appendices = []

# If false, no module index is generated.
#texinfo_domain_indices = True

# How to display URL addresses: 'footnote', 'no', or 'inline'.
#texinfo_show_urls = 'footnote'

# If true, do not generate a @detailmenu in the "Top" node's menu.
#texinfo_no_detailmenu = False

# app setup hook
def setup(app):
    app.add_config_value('recommonmark_config', {
        #'url_resolver': lambda url: github_doc_root + url,
        'auto_toc_tree_section': 'Contents',
        'enable_math': False,
        'enable_inline_math': False,
        'enable_eval_rst': True,
        'enable_auto_doc_ref': True,
    }, True)
    app.add_transform(AutoStructify)

为了正常显示表格需要进行格式转换,这里使用pandoc

pandoc -s -t rst --toc newpage.md -o newpage.rst

可以使用下面的脚本批量转换

#! /usr/bin/env python
#coding=utf-8
import os,sys,re,traceback


class Md2rstConvertor:
    
    #change
    def convert(self,filename):
        lines = []
        isCodeLine = False
        datafile = open(filename,"r")
        for line in datafile:
            # 识别代码段
            m = re.match(r'`{3}[ ]*',line)  
            if m is not None:
                if isCodeLine == False:
                    lines.append('::\n\n')
                    isCodeLine = True
                else:
                    isCodeLine = False # code块结束
                    lines.append('\n')
                continue
            
            if isCodeLine == True:
                lines.append('\t' + line.strip() + '\n')
                continue
            else:
                # 识别一级标题
                m = re.match(r'[ ]*#{1}[^#]+',line)  
                if m is not None:
                    lines.append(line.strip('# \r\n') + '\n')
                    lines.append('=' * len(line) + '\n')
                    continue
                
                # 识别二级标题
                m = re.match(r'[ ]*#{2}[^#]+',line)  
                if m is not None:
                    lines.append(line.strip('# \r\n') + '\n')
                    lines.append('-' * len(line) + '\n')
                    continue
                
                # 识别三级标题
                m = re.match(r'[ ]*#{3}[^#]+',line)  
                if m is not None:
                    lines.append(line.strip('# \r\n') + '\n')
                    lines.append('~' * len(line) + '\n')
                    continue
                
                lines.append(line.strip() + '\n')

            
        
        rstFile = '%s.rst' % filename.split('.')[0]
        fw = open(rstFile,'w')
        for str in lines:
            fw.writelines(str )
        fw.close()
        
        

if __name__ == '__main__':

    convertor = Md2rstConvertor()
    if len(sys.argv) <= 1:
        print ("please input file name to convert :('all' for all file in cur dir)")
        exit()
    
    arg = sys.argv[1]
    if arg == 'all':
        # 转换当前文件夹下所有md文件
        for filename in os.listdir('./'):
            if os.path.isfile(filename): # 过滤文件夹
                fix = filename.split('.')[-1]
                if  fix.upper() == "MD" :
                    convertor.convert(filename)
    else:
        convertor.convert(arg)       

GitHub 托管

一般的做法是将文档托管到版本控制系统比如 github上面,push 源码后自动构建发布到 readthedoc上面, 这样既有版本控制好处,又能自动发布到 readthedoc,实在是太方便了。
具体几个步骤非常简单,参考官方文档:https://github.com/rtfd/readthedocs.org:

  • 在 Read the Docs上面注册一个账号
  • 登陆后点击 “Import”.
  • 给该文档项目填写一个名字比如 “scrapy-cookbook”, 并添加你在 GitHub 上面的工程 HTTPS 链接, 选择仓库类型为 Git
  • 其他项目根据自己的需要填写后点击 “Create”,创建完后会自动去激活 Webhooks,不用再去GitHub设置
posted @ 2020-04-18 22:02  刘-皇叔  阅读(845)  评论(0编辑  收藏  举报