URL的结构

资源:
- RFC1808_中文 2007版
- w3c英文版

RFC 1808 Relative Uniform Resource Locators June 1995
格式: <scheme>://<net_loc>/<path>;<params>?<query>#<fragment>
each of which, except , may be absent from a particular URL.
These components are defined as follows (a complete BNF is provided
in Section 2.2):

  scheme ":"   ::= scheme name, as per Section 2.1 of RFC 1738 [2].

  "//" net_loc ::= network location and login information, as per
                   Section 3.1 of RFC 1738 [2].

  "/" path     ::= URL path, as per Section 3.1 of RFC 1738 [2].

  ";" params   ::= object parameters (e.g., ";type=a" as in
                   Section 3.2.2 of RFC 1738 [2]).

  "?" query    ::= query information, as per Section 3.3 of
                   RFC 1738 [2].

  "#" fragment ::= fragment identifier.

The following are two example URIs and their component parts:

     foo://example.com:8042/over/there?name=ferret#nose
     \_/   \______________/\_________/ \_________/ \__/
      |           |            |            |        |
   scheme     authority       path        query   fragment
      |   _____________________|__
     / \ /                        \
     urn:example:animal:ferret:nose

python中有一个urllib.parse库,可以用来解析url。#��

版权声明:本文为博主原创文章,未经博主允许不得转载。

posted on 2015-07-26 09:16  Broven  阅读(125)  评论(0编辑  收藏  举报

导航