拾墨、 - 博客园

2024年8月9日

摘要：链接是将各种代码和数据片段收集并组合成为一个单一文件的过程，这个文件可被加载(复制)到内存并执行。 1. 编译器驱动程序看一下链接的整个过程驱动程序运行c预处理器(cpp)，将main.c翻译成一个ASCII码的中间文件main.i 驱动程序运行c编译器(cll)，将main.i翻译成一个ASC 阅读全文

posted @ 2024-08-09 16:13 拾墨、阅读(45) 评论(0) 推荐(0)

2024年7月17日

openEuler 环境源码编译mysql 8.0.37

摘要：部分参考博客 https://blog.csdn.net/ghpanxt/article/details/119387253 1、安装依赖：yum install -y openssl-devel ncurses-devel libaio libaio-devel libtirpc-devel op 阅读全文

posted @ 2024-07-17 02:15 拾墨、阅读(268) 评论(0) 推荐(0)

2024年7月10日

git 如何 fork 一个仓库的所有分支

摘要：假设要fork的仓库名称为a，你的本地仓库名称为b 克隆a仓库的ssh git clone git@github.com:username/a.git cd a 添加b仓库为上游(upstream)远程仓库 git remote add upstream git@github.com:usernam 阅读全文

posted @ 2024-07-10 08:31 拾墨、阅读(530) 评论(1) 推荐(1)

2024年6月22日

1. hello rvos

摘要：用qemu模拟器模拟一块仿真的板子，然后通过模拟的串口线，在主机上打印hello 在qemu中，关于uart寄存器的内存映射如图(物理地址) 1. 源代码剖析 1. platform.h 用来定义开发板上的一些东西 #ifndef __PLATFORM_H__ #define __PLATFORM_ 阅读全文

posted @ 2024-06-22 22:36 拾墨、阅读(63) 评论(0) 推荐(0)

2024年6月10日

risc-v中的函数调用

摘要：先来看一个普通main函数的完整执行过程(以a=b problem为例) int main() { int a = 2; int b = 3; int c = a + b; } 其risc-v(rv32)的汇编如下 main: addi sp,sp,-32 # 将栈指针sp向下移动32个字节，预留栈阅读全文

posted @ 2024-06-10 21:42 拾墨、阅读(619) 评论(0) 推荐(0)

2024年3月31日

cuda 流

摘要：如下图，将多个执行相同核函数的进程通过cuda流来使他们并发执行，提升效率这很像cpu的流水线想让下面这个核函数执行两次，每次都是不同的参数我们需要用到cuda的流来并发的执行提升效率 __global__ void kernel( int *a, int *b, int *c ) { int 阅读全文

posted @ 2024-03-31 11:11 拾墨、阅读(61) 评论(0) 推荐(0)

2024年3月28日

cuda原子操作

摘要：如果不用原子操作，在进行计算直方图时会发生计算冲突 d_b[i]为h_a中数字i有几个下面的代码将h_a全赋值为3，但d_b[3]却为1 #include<iostream> #include "cuda_runtime.h" #include "device_launch_parameters. 阅读全文

posted @ 2024-03-28 16:10 拾墨、阅读(44) 评论(0) 推荐(0)

2024年3月23日

用mpi实现积分运算求π值

摘要：这篇文章例子来自于 https://zhuanlan.zhihu.com/p/399150417 思想参考于我之前这篇博客 https://www.cnblogs.com/algoshimo/p/18063325 中若线程不够用的情况 # include "mpi.h" # include<stdi 阅读全文

posted @ 2024-03-23 22:02 拾墨、阅读(94) 评论(0) 推荐(0)

2024年3月21日

cuda 内存模型

摘要： cuda内存模型其实概括来说就是下面三张图双箭头代表可读可写，单箭头代表只读 1. local memory #include<iostream> #include "cuda_runtime.h" #include "device_launch_parameters.h" # define BL 阅读全文

posted @ 2024-03-21 20:26 拾墨、阅读(154) 评论(0) 推荐(0)

2024年3月20日

cuda规约运算

摘要：归:递归约:减小对于一个矩阵做求和运算若串行求和的话需要o(n)的复杂度但若向下图这样，俩俩并行相加，只需要o(logn)的复杂度 __global__ void reduceNeighbored(int* g_idata, int* g_odata, unsigned int n) //g 阅读全文

posted @ 2024-03-20 15:25 拾墨、阅读(99) 评论(0) 推荐(0)

algoshimo

青青子衿，悠悠我心但为君故，沉吟至今

公告

algoshimo

青青子衿，悠悠我心 但为君故，沉吟至今

公告

青青子衿，悠悠我心但为君故，沉吟至今