https://coderwall.com/p/whf3-a/hierarchical-data-in-postgres

-------------------------------------------------------------------------------

This tip will try to answer the following questions:

  • How can we represent a tree of data in postgres
  • How can we efficiently query for any entire single node and all of it's children (and children's children).

The test data

Since we want to keep this simple we will assume our data is just a bunch of sections. A section just has a nameand each section has a single parent section.

  1. Section A
  2. |--- Section A.1
  3. Section B
  4. |--- Section B.1
  5. |--- Section B.1
  6. |--- Section B.1.1

We'll use this simple data for examples below.

Simple self-referencing

When designing a self-referential table (something that joins itself to itself) the most obvious choice is to have some kind of parent_id column on our table that references itself.

  1. CREATE TABLE section (
  2. id INTEGER PRIMARY KEY,
  3. name TEXT,
  4. parent_id INTEGER REFERENCES section,
  5. );
  6. ALTER TABLE page ADD COLUMN parent_id INTEGER REFERENCES page;
  7. CREATE INDEX section_parent_id_idx ON section (parent_id);

Now insert our example data, using the parent_id to related the nodes together:

  1. INSERT INTO section (id, name, parent_id) VALUES (1, 'Section A', NULL);
  2. INSERT INTO section (id, name, parent_id) VALUES (2, 'Section A.1', 1);
  3. INSERT INTO section (id, name, parent_id) VALUES (3, 'Section B', NULL);
  4. INSERT INTO section (id, name, parent_id) VALUES (4, 'Section B.1', 3);
  5. INSERT INTO section (id, name, parent_id) VALUES (5, 'Section B.2', 3);
  6. INSERT INTO section (id, name, parent_id) VALUES (6, 'Section B.2.1', 5);

This works great for simple queries such as, fetch the direct children of Section B:

  1. SELECT * FROM section WHERE parent = 3

but it will require complex or recursive queries for questions like fetch me all the children and children's children of Section B:

  1. WITH RECURSIVE nodes(id,name,parent_id) AS (
  2. SELECT s1.id, s1.name, s1.parent_id
  3. FROM section s1 WHERE parent_id = 3
  4. UNION
  5. SELECT s2.id, s2.name, s2.parent_id
  6. FROM section s2, nodes s1 WHERE s2.parent_id = s1.id
  7. )
  8. SELECT * FROM nodes;

So we have answered the "how to build a tree" part of the question, but are not happy with the "how to query for a node and all it's children" part.

Enter ltree. (Short for "label tree" - I think?).

The ltree extension

The ltree extension is a great choice for querying hierarchical data. This is especially true for self-referential relationships.

Lets rebuild the above example using ltree. We'll use the page's primary keys as the "labels" within our ltree paths and a special "root" label to denote the top of the tree.

  1. CREATE EXTENSION ltree;
  2. CREATE TABLE section (
  3. id INTEGER PRIMARY KEY,
  4. name TEXT,
  5. parent_path LTREE
  6. );
  7. CREATE INDEX section_parent_path_idx ON section USING GIST (parent_path);

We'll add in our data again, this time rather than using the id for the parent, we will construct an ltree path that represents the parent node.

  1. INSERT INTO section (id, name, parent_path) VALUES (1, 'Section 1', 'root');
  2. INSERT INTO section (id, name, parent_path) VALUES (2, 'Section 1.1', 'root.1');
  3. INSERT INTO section (id, name, parent_path) VALUES (3, 'Section 2', 'root');
  4. INSERT INTO section (id, name, parent_path) VALUES (4, 'Section 2.1', 'root.3');
  5. INSERT INTO section (id, name, parent_path) VALUES (4, 'Section 2.2', 'root.3');
  6. INSERT INTO section (id, name, parent_path) VALUES (5, 'Section 2.2.1', 'root.3.4');

Cool. So now we can make use of ltree's operators @> and <@ to answer our original question like:

  1. SELECT * FROM section WHERE parent_path <@ 'root.3';

However we have introduced a few issues.

  • Our simple parent_id version ensured referential consistancy by making use of the REFERENCES constraint. We lost that by switching to ltree paths.
  • Ensuring that the ltree paths are valid can be a bit of a pain, and if paths become stale for some reason your queries may return unexpected results or you may "orphan" nodes.

The final solution

To fix these issues we want a hybrid of our original parent_id (for the referential consistency and simplicity of the child/parent relationship) and our ltree paths (for improved querying power/indexing). To achieve this we will hide the management of the ltree path behind a trigger and only ever update the parent_id column.

  1. CREATE EXTENSION ltree;
  2. CREATE TABLE section (
  3. id INTEGER PRIMARY KEY,
  4. name TEXT,
  5. parent_id INTEGER REFERENCES section,
  6. parent_path LTREE
  7. );
  8. CREATE INDEX section_parent_path_idx ON section USING GIST (parent_path);
  9. CREATE INDEX section_parent_id_idx ON section (parent_id);
  10. CREATE OR REPLACE FUNCTION update_section_parent_path() RETURNS TRIGGER AS $$
  11. DECLARE
  12. path ltree;
  13. BEGIN
  14. IF NEW.parent_id IS NULL THEN
  15. NEW.parent_path = 'root'::ltree;
  16. ELSEIF TG_OP = 'INSERT' OR OLD.parent_id IS NULL OR OLD.parent_id != NEW.parent_id THEN
  17. SELECT parent_path || id::text FROM section WHERE id = NEW.parent_id INTO path;
  18. IF path IS NULL THEN
  19. RAISE EXCEPTION 'Invalid parent_id %', NEW.parent_id;
  20. END IF;
  21. NEW.parent_path = path;
  22. END IF;
  23. RETURN NEW;
  24. END;
  25. $$ LANGUAGE plpgsql;
  26. CREATE TRIGGER parent_path_tgr
  27. BEFORE INSERT OR UPDATE ON section
  28. FOR EACH ROW EXECUTE PROCEDURE update_section_parent_path();

Much better.

More

Use json and plv8 to work with tree data

Written by Chris Farmiloe

Hierarchical data in postgres的更多相关文章

  1. asp.net Hierarchical Data

    Introduction A Hierarchical Data is a data that is organized in a tree-like structure and structure ...

  2. mysql 树形数据,层级数据Managing Hierarchical Data in MySQL

    原文:http://mikehillyer.com/articles/managing-hierarchical-data-in-mysql/ 引言 大多数用户都曾在数据库中处理过分层数据(hiera ...

  3. Managing Hierarchical Data in MySQL

    Managing Hierarchical Data in MySQL Introduction Most users at one time or another have dealt with h ...

  4. Managing Hierarchical Data in MySQL(邻接表模型)[转载]

    原文在:http://dev.mysql.com/tech-resources/articles/hierarchical-data.html 来源: http://www.cnblogs.com/p ...

  5. 云原生 PostgreSQL 集群 - PGO:来自 Crunchy Data 的 Postgres Operator

    使用 PGO 在 Kubernetes 上运行 Cloud Native PostgreSQL:来自 Crunchy Data 的 Postgres Operator! Cloud Native Po ...

  6. [Postgres] Group and Aggregate Data in Postgres

    How can we see a histogram of movies on IMDB with a particular rating? Or how much movies grossed at ...

  7. 《利用Python进行数据分析: Python for Data Analysis 》学习随笔

    NoteBook of <Data Analysis with Python> 3.IPython基础 Tab自动补齐 变量名 变量方法 路径 解释 ?解释, ??显示函数源码 ?搜索命名 ...

  8. Following a Select Statement Through Postgres Internals

    This is the third of a series of posts based on a presentation I did at the Barcelona Ruby Conferenc ...

  9. ZOJ 3826 Hierarchical Notation 模拟

    模拟: 语法的分析 hash一切Key建设规划,对于记录在几个地点的每个节点原始的字符串开始输出. . .. 对每一个询问沿图走就能够了. .. . Hierarchical Notation Tim ...

随机推荐

  1. day03_12 缩进介绍

    python比较变态,必须缩进,而C可以不缩进,世界上只有python这门语言要求必须缩进 tab键是缩进,shift+tab往左移动 SyntaxError:invalid syntax 语法错误 ...

  2. df和du显示的磁盘空间使用情况不一致问题

    背景介绍: dba同事删除了mysql /datao目录下的文件,通过du –sh查看空间使用700G,df -h查看空间使用1T,没有重启mysql服务. 另一个表现出du与df命令不同之处的例子如 ...

  3. 聊聊、Highcharts 动态数据优化版

    好久没来博客园了,最近项目太紧.上一篇写了 <Highcharts 之[动态数据]>,不够完善,虽然横纵轴可以动态取数据,但是行业信息是固定的,不能随着大数据热点改变.废话不说,直接上代码 ...

  4. Oracle实例和数据库区别

          什么是数据库,其实很简单,数据库就是存储数据的一种媒介.比如常用的文件就是一种,在Oracle10G中,数据的存储有好几种.第一种是文件形式,也就是在你的磁盘中创建一批文件,然后在这些文件 ...

  5. DS博客作业05--树

    1.本周学习总结 1.思维导图 2.谈谈你对树结构的认识及学习体会 学完树之后,最大的感觉就是在处理节点之间的兄弟父亲关系的时候真的挺好用的,一目了然.不过,树令人比较头疼的就是要用递归,大致能懂递归 ...

  6. not exists、left join/is null、not in 行为

    测试数据 20:25:52[test](;)> select * from t;+------+------+| id   | b    |+------+------+|    1 | NUL ...

  7. 【bzoj1907】树的路径覆盖 树形dp

    题目描述 输入 输出 样例输入 1 7 1 2 2 3 2 4 4 6 5 6 6 7 样例输出 3 题解 树形dp 设f[x]表示以x为根的子树完成路径覆盖,且x为某条路径的一端(可以向上延伸)的最 ...

  8. 用JS实现倒计时(日期字符串作为参数)

    <!doctype html> <html> <head> <meta charset="utf-8"> <title> ...

  9. jsp生成war

    安装java 环境,进入jsp所在目录,使用如下命令可将当前目录中所有文件打成到xss.war包中,正常的war包中还包含另外两个文件meta-inf,web-inf,在生成的时候,需要把这两个文件加 ...

  10. eclipse testng插件安装

    1.安装Testng 在Eclipse中,点击Help→Install new software→点击Add,在location中输入http://beust.com/eclipse ,选择TestN ...