MongoDB开发深入之一：文档数据关系模型详解（一对多，多对多）

文档关联模型通常有3种方式：

嵌入式（一对一、一对多）
后期手动统一ID处理（一对多、多对多）
References引用（一对一、一对多）

文档树模型通常有3种方式：

父引用（Parent References）
子引用（Child References）
祖先数组（Array of Ancestors ）
物化路径（Materialized Paths ）
嵌套_Set，不常用，不详写了

关联模型

1、嵌入式

直接在单独文档中嵌入“子文档”

嵌入一个文档(1对1)：

{
   _id: "joe",
   name: "Joe Bookreader",
   address: {
              street: "123 Fake Street",
              city: "Faketon",
              state: "MA",
              zip: "12345"
            }
}

嵌入多个文档(1对n)，使用“数组”形式：

{
   _id: "joe",
   name: "Joe Bookreader",
   addresses: [
                {
                  street: "123 Fake Street",
                  city: "Faketon",
                  state: "MA",
                  zip: "12345"
                },
                {
                  street: "1 Some Other Street",
                  city: "Boston",
                  state: "MA",
                  zip: "12345"
                }
              ]
 }

这个很简单。

2、后期手动统一ID处理

后期统一ID，用于程序中确立关联，可以处理1对多，也可以处理多对多（通过中间关联表来实现），比如一个典型的多对多中间表，用于确定用户的权限：用户ID-权限ID
使用手动ID，是在另一个文档中包含一个文档 _id字段的做法。然后，应用程序可以发出第二个查询以根据需要的（ID）解析数据。无论是一对一，还是多对多都可用.

original_id = ObjectId()

db.places.insert({
    "_id": original_id,
    "name": "Broadway Center",
    "url": "bc.example.net"
})

db.people.insert({
    "name": "Erin",
    "places_id": original_id,
    "url":  "bc.example.net/Erin"
})

3、References引用

出版商和书的一对一、一对多关联（出版商可以出版很多书）
这个是正确的姿势，确保关联键（publisher_id）必须只有一个值：

publisher：
{
   _id: "oreilly",
   name: "O'Reilly Media",
   founded: 1980,
   location: "CA"
}
books：
{
   _id: 123456789,
   title: "MongoDB: The Definitive Guide",
   author: [ "Kristina Chodorow", "Mike Dirolf" ],
   published_date: ISODate("2010-09-24"),
   pages: 216,
   language: "English",
   publisher_id: "oreilly"
}

{
   _id: 234567890,
   title: "50 Tips and Tricks for MongoDB Developer",
   author: "Kristina Chodorow",
   published_date: ISODate("2011-05-06"),
   pages: 68,
   language: "English",
   publisher_id: "oreilly"
}

错误的引用方式，books的数组持续增长，造成失控，必须使用原子操作来维护数组。

{
   name: "O'Reilly Media",
   founded: 1980,
   location: "CA",
   books: [123456789, 234567890, ...]
}

总结正确的姿势：在父子表中的子表中引用父ID，在这里books为子表，publisher为父表，其实和关系数据库的设计模型是一致的。

树结构

1、父引用（Parent References）

以下的树结构，如图：

数据模型：

db.categories.insert( { _id: "MongoDB", parent: "Databases" } )
db.categories.insert( { _id: "dbm", parent: "Databases" } )
db.categories.insert( { _id: "Databases", parent: "Programming" } )
db.categories.insert( { _id: "Languages", parent: "Programming" } )
db.categories.insert( { _id: "Programming", parent: "Books" } )
db.categories.insert( { _id: "Books", parent: null } )

父引用的含义，顾名思义就是每个子节点指定自己的父亲。

检索节点的父亲的查询：
db.categories.findOne( { _id: "MongoDB" } ).parent

在字段parent上创建索引以启用 parent 节点的快速搜索：
db.categories.createIndex( { parent: 1 } )

通过parent字段查询以查找其直接的子节点：
db.categories.find( { parent: "Databases" } )

要检索子树，可使用$graphLookup。

2、子引用（Child References）

顾名思义就是父亲要标明自己的儿子（数组），感觉模型有些不大合理。最好别沾。

db.categories.insert( { _id: "MongoDB", children: [] } )
db.categories.insert( { _id: "dbm", children: [] } )
db.categories.insert( { _id: "Databases", children: [ "MongoDB", "dbm" ] } )
db.categories.insert( { _id: "Languages", children: [] } )
db.categories.insert( { _id: "Programming", children: [ "Databases", "Languages" ] } )
db.categories.insert( { _id: "Books", children: [ "Programming" ] } )

检索节点的直接 children 的查询：
db.categories.findOne( { _id: "Databases" } ).children
您可以在字段children上创建索引以启用 child 节点的快速搜索：
db.categories.createIndex( { children: 1 } )
您可以在children字段中查询节点以查找其 parent 节点及其兄弟节点：
db.categories.find( { children: "MongoDB" } )

3、完全祖先数组（Array of Ancestors ）

这个比较合理，就是存储父的全路径数组。每个树节点除了父节点，还存储所有的节点的祖先。

db.categories1.insert( { _id: "MongoDB", ancestors: [ "Books", "Programming", "Databases" ], parent: "Databases" } )
db.categories1.insert( { _id: "dbm", ancestors: [ "Books", "Programming", "Databases" ], parent: "Databases" } )
db.categories1.insert( { _id: "Databases", ancestors: [ "Books", "Programming" ], parent: "Programming" } )
db.categories1.insert( { _id: "Languages", ancestors: [ "Books", "Programming" ], parent: "Programming" } )
db.categories1.insert( { _id: "Programming", ancestors: [ "Books" ], parent: "Books" } )
db.categories1.insert( { _id: "Books", ancestors: [ ], parent: null } )

用于检索节点的祖先或路径的查询快速而直接：
db.categories1.findOne( { _id: "MongoDB" } ).ancestors

您可以在字段ancestors上创建索引以启用祖先节点的快速搜索：
db.categories1.createIndex( { ancestors: 1 } )

您可以通过字段ancestors查询以查找其所有后代：
db.categories1.find( { ancestors: "Programming" } )


Array of Ancestors pattern 提供了一种快速有效的解决方案，通过创建祖先字段元素的索引来查找节点的后代和祖先。是开发应用中的好选择。

4、物化路径（Materialized Paths ）

和完全祖先数组原理一样，只是在处理路径时提供了更大的灵活性，例如通过 partial paths 查找节点。

db.categories.insert( { _id: "Books", path: null } )
db.categories.insert( { _id: "Programming", path: ",Books," } )
db.categories.insert( { _id: "Databases", path: ",Books,Programming," } )
db.categories.insert( { _id: "Languages", path: ",Books,Programming," } )
db.categories.insert( { _id: "MongoDB", path: ",Books,Programming,Databases," } )
db.categories.insert( { _id: "dbm", path: ",Books,Programming,Databases," } )

您可以查询以检索整个树，按字段path排序：
db.categories.find().sort( { path: 1 } )
您可以在path字段上使用正则表达式来查找Programming的后代：
db.categories.find( { path: /,Programming,/ } )
您还可以检索Books的后代，其中Books也位于层次结构的最顶层 level：
db.categories.find( { path: /^,Books,/ } )
要在字段path上创建索引，请使用以下调用：
db.categories.createIndex( { path: 1 } )
此索引可能会根据查询提高 performance：
对于来自根Books sub-tree(e.g. /^,Books,/或/^,Books,Programming,/)的查询，path字段上的索引可显着改进查询性能。
对于 sub-trees 的查询，其中查询(e.g. /,Databases,/)中未提供根的路径，或者 sub-trees 的类似查询，其中节点可能位于索引 string 的中间，查询必须检查整个索引。
对于这些查询，如果索引明显小于整个集合，则索引可以提供一些性能改进。

posted @ 2019-12-02 11:02 昕友软件开发阅读(2651) 评论(0) 收藏举报

刷新页面返回顶部

昕友软件开发

知行合一