一起来了解hashmap核心机制

HashMap 是 Java 中常用的集合类，用于存储键值对（key-value）。理解其核心机制需要深入源码，了解其内部结构、哈希算法、冲突处理、扩容机制等。以下是对 Java 8 及以后版本中 HashMap 核心机制的详细讲解。

1. HashMap 的基本结构

在 Java 8 中，HashMap 主要由以下几个核心部分组成：

数组（table）：底层数据结构是一个数组，数组中的每个元素称为“桶”（bucket），用于存储键值对。
链表和红黑树：当发生哈希冲突时，链表用于存储多个键值对。为了提高性能，当链表长度超过一定阈值（默认为 8）时，链表将转换为红黑树，以降低查找时间复杂度。
Node 类：内部类，用于表示每个键值对，继承自 Map.Entry<K, V>，并包含键、值、哈希值以及指向下一个节点的引用。

static class Node<K,V> implements Map.Entry<K,V> {
    final int hash;
    final K key;
    V value;
    Node<K,V> next;

    // 构造方法和其他方法
}

2. 哈希算法与索引计算

HashMap 通过哈希函数将键映射到数组中的索引位置。Java 8 中使用的哈希函数经过优化，以减少哈希冲突。

哈希函数

在 Java 8 中，HashMap 的 hash(Object key) 方法用于计算键的哈希值：

static final int hash(Object key) {
    int h;
    return (key == null) ? 0 : (h = key.hashCode()) ^ (h >>> 16);
}

这里，h = key.hashCode() 获取键的原始哈希码，然后通过 h ^ (h >>> 16) 进行扰动，以减少不同对象可能产生相同高位的情况，从而降低哈希冲突几率。

索引计算

通过 indexFor 方法计算键值对在数组中的索引位置：

static int indexFor(int hash, int length) {
    return hash & (length - 1);
}

这里使用位运算 hash & (length - 1)，要求数组长度 length 必须是 2 的幂，以确保索引分布均匀。

3. 插入（put）操作

插入操作包括以下步骤：

计算哈希值和索引：通过哈希函数计算键的哈希值，然后计算索引位置。
处理冲突：
- 如果桶为空，直接插入新的节点。
- 如果桶不为空，遍历链表或树，检查是否有相同键存在。
  - 如果存在，覆盖旧值。
  - 如果不存在，添加新的节点。如果链表长度超过阈值（8），转换为红黑树。
扩容：如果当前元素数量超过阈值（负载因子 0.75 * 数组长度），则触发扩容，将数组长度翻倍，并重新分配所有节点。

代码示例

以下是 putVal 方法的核心部分：

final V putVal(int hash, K key, V value, boolean onlyIfAbsent, boolean evict) {
    Node<K,V>[] tab; Node<K,V> p; int n, i;
    if ((tab = table) == null || (n = tab.length) == 0)
        n = (tab = resize()).length;
    if ((p = tab[i = (n - 1) & hash]) == null)
        tab[i] = newNode(hash, key, value, null);
    else {
        Node<K,V> e; K k;
        if (p.hash == hash && ((k = p.key) == key || (key != null && key.equals(k))))
            e = p;
        else if (p instanceof TreeNode)
            e = ((TreeNode<K,V>) p).putTreeVal(this, tab, hash, key, value);
        else {
            for (int binCount = 0; ; ++binCount) {
                if ((e = p.next) == null) {
                    p.next = newNode(hash, key, value, null);
                    if (binCount >= TREEIFY_THRESHOLD - 1)
                        treeifyBin(tab, hash);
                    break;
                }
                if (e.hash == hash && ((k = e.key) == key || (key != null && key.equals(k))))
                    break;
                p = e;
            }
        }
        if (e != null) { // existing key
            V oldValue = e.value;
            if (!onlyIfAbsent || oldValue == null)
                e.value = value;
            afterNodeAccess(e);
            return oldValue;
        }
    }
    ++modCount;
    if (++size > threshold)
        resize();
    afterNodeInsertion(evict);
    return null;
}

4. 冲突处理与链表转红黑树

当多个键映射到同一个索引位置时，会在该桶中形成链表。当链表长度超过 8 时，HashMap 会将链表转换为红黑树，以提高查找效率。

转换为红黑树

treeifyBin 方法负责将链表转换为红黑树：

final void treeifyBin(Node<K,V>[] tab, int hash) {
    int n, index; Node<K,V> e;
    if (tab == null || (n = tab.length) < MIN_TREEIFY_CAPACITY)
        resize();
    else if ((e = tab[index = (n - 1) & hash]) != null) {
        TreeNode<K,V> hd = null, tl = null;
        do {
            TreeNode<K,V> p = replacementTreeNode(e, null);
            if (tl == null)
                hd = p;
            else {
                p.prev = tl;
                tl.next = p;
            }
            tl = p;
        } while ((e = e.next) != null);
        if (hd != null)
            tab[index] = hd;
        if (hd != null)
            hd.treeify(tab);
    }
}

红黑树节点

TreeNode 是 HashMap 的内部类，扩展了 LinkedHashMap.Entry，用于实现红黑树的节点结构，并包含额外的指针用于树的平衡操作。

5. 查找（get）操作

查找操作主要通过计算键的哈希值和索引，然后在相应的链表或红黑树中查找。

代码示例

getNode 方法用于查找节点：

final Node<K,V> getNode(int hash, Object key) {
    Node<K,V>[] tab; Node<K,V> first, e; int n; K k;
    if ((tab = table) != null && (n = tab.length) > 0 &&
        (first = tab[(n - 1) & hash]) != null) {
        if (first.hash == hash && ((k = first.key) == key || (key != null && key.equals(k))))
            return first;
        if (first instanceof TreeNode)
            return ((TreeNode<K,V>) first).getTreeNode(hash, key);
        for (e = first.next; e != null; e = e.next) {
            if (e.hash == hash &&
                ((k = e.key) == key || (key != null && key.equals(k))))
                return e;
        }
    }
    return null;
}

如果桶中的第一个节点匹配，则直接返回；如果是红黑树，则调用 getTreeNode 在树中查找；否则，遍历链表查找。

6. 扩容（resize）机制

当 HashMap 中的元素数量超过阈值（默认 0.75 * 数组长度）时，会触发扩容，将数组长度翻倍，并重新分配所有节点。

代码示例

resize 方法的核心逻辑：

final Node<K,V>[] resize() {
    Node<K,V>[] oldTab = table;
    int oldCap = (oldTab == null) ? 0 : oldTab.length;
    int oldThr = threshold;
    int newCap, newThr = 0;
    if (oldCap > 0) {
        if (oldCap >= MAXIMUM_CAPACITY) {
            threshold = Integer.MAX_VALUE;
            return oldTab;
        }
        newCap = oldCap << 1;
        if (newCap < MAXIMUM_CAPACITY && oldCap >= DEFAULT_INITIAL_CAPACITY)
            newThr = oldThr << 1;
    }
    else if (oldThr > 0) {
        newCap = oldThr;
    }
    else {
        newCap = DEFAULT_INITIAL_CAPACITY;
        newThr = (int)(DEFAULT_LOAD_FACTOR * DEFAULT_INITIAL_CAPACITY);
    }
    if (newThr == 0) {
        newThr = (int)(newCap * loadFactor);
    }
    threshold = newThr;
    @SuppressWarnings({"rawtypes","unchecked"})
    Node<K,V>[] newTab = (Node<K,V>[])new Node[newCap];
    table = newTab;
    if (oldTab != null) {
        for (int j = 0; j < oldCap; ++j) {
            Node<K,V> e;
            if ((e = oldTab[j]) != null) {
                oldTab[j] = null;
                if (e.next == null)
                    newTab[e.hash & (newCap - 1)] = e;
                else if (e instanceof TreeNode)
                    ((TreeNode<K,V>) e).split(this, newTab, j, oldCap);
                else { // 批量转移链表节点
                    Node<K,V> loHead = null, loTail = null;
                    Node<K,V> hiHead = null, hiTail = null;
                    Node<K,V> next;
                    do {
                        next = e.next;
                        if ((e.hash & oldCap) == 0) {
                            if (loTail == null)
                                loHead = e;
                            else
                                loTail.next = e;
                            loTail = e;
                        }
                        else {
                            if (hiTail == null)
                                hiHead = e;
                            else
                                hiTail.next = e;
                            hiTail = e;
                        }
                    } while ((e = next) != null);
                    if (loTail != null) {
                        loTail.next = null;
                        newTab[j] = loHead;
                    }
                    if (hiTail != null) {
                        hiTail.next = null;
                        newTab[j + oldCap] = hiHead;
                    }
                }
            }
        }
    }
    return newTab;
}

重新分配节点

扩容后，HashMap 需要将旧数组中的节点重新分配到新数组中。对于每个节点，根据新的数组长度重新计算索引位置。

链表节点：通过位运算将链表节点分配到新数组中。
红黑树节点：调用 split 方法，将树节点分配到新数组中。

7. 删除（remove）操作

删除操作类似于查找，首先定位到节点，然后调整链表或红黑树结构以移除目标节点。

代码示例

removeNode 方法用于删除节点：

final V removeNode(int hash, Object key, Object value, boolean matchValue, boolean movable) {
    Node<K,V>[] tab; Node<K,V> p; int n, index;
    if ((tab = table) != null && (n = tab.length) > 0 &&
        (p = tab[index = (n - 1) & hash]) != null) {
        Node<K,V> node = null, e; K k;
        if (p.hash == hash &&
            ((k = p.key) == key || (key != null && key.equals(k))))
            node = p;
        else if (p instanceof TreeNode)
            node = ((TreeNode<K,V>) p).getTreeNode(hash, key);
        else {
            for (int i = 0; i < BUCKET_SIZE_LIMIT && (e = p.next) != null; i++) {
                if (e.hash == hash &&
                    ((k = e.key) == key || (key != null && key.equals(k)))) {
                    node = e;
                    break;
                }
                p = e;
            }
        }
        if (node != null && (!matchValue || Objects.equals(node.value, value))) {
            if (node instanceof TreeNode)
                ((TreeNode<K,V>) node).removeTreeNode(this, tab, movable);
            else if (node == p)
                tab[index] = node.next;
            else
                p.next = node.next;
            ++modCount;
            --size;
            afterNodeRemoval(node);
            return node.value;
        }
    }
    return null;
}

8. 其他关键方法

遍历：HashMap 提供多种遍历方式，如 forEach、keySet、entrySet 等，底层通过迭代器实现。
迭代器：HashMap 的迭代器实现了 Iterator 接口，通过链表或红黑树遍历节点。

9. 性能优化

Java 8 对 HashMap 进行了多项优化：

树化：当链表长度超过阈值时，转换为红黑树，提高查找性能。
扰动函数：优化哈希函数，减少哈希冲突。
延迟初始化：默认情况下，HashMap 的数组在首次插入时才会初始化，节省内存。
并发优化：虽然 HashMap 不是线程安全的，但其内部结构设计便于扩展为线程安全的 ConcurrentHashMap。

posted @ 2024-11-20 15:26 DaLiuc 阅读(12) 评论(0) 编辑收藏举报

刷新页面返回顶部

一起来了解hashmap核心机制

1. HashMap 的基本结构

2. 哈希算法与索引计算

哈希函数

索引计算

3. 插入（put）操作

代码示例

4. 冲突处理与链表转红黑树

转换为红黑树

红黑树节点

5. 查找（get）操作

代码示例

6. 扩容（resize）机制

代码示例

重新分配节点

7. 删除（remove）操作

代码示例

8. 其他关键方法

9. 性能优化

公告