LXR | KVM | PM | Time | Interrupt | Systems Performance | Bootup Optimization

ksar、sar及相关内核知识点解析

关键词:sar、sadc、ksar、/proc/stat、/proc/cpuinfo、/proc/meminfo、/proc/diskstats。

在之前有简单介绍过sar/ksar,最近在使用中感觉需要再深入了解一下。

ksar/sar从内核采集数据,并输出可读性数据。分析相关源码,有助于知道数据来龙去脉。--------------------------------------1. sar源码概览

ksar显示sar/sadc获取的数据,并图形化显示。数据从内核节点,到sadc/sar转换,再到ksar显示。----------------------------2. ksar处理流程

对照ksar每张图表,然后sar/sadc对应的采集转换,再到内核每个数据项含义解析。--------------------------------------------3. ksar解读

期望是从ksar上的图表能对应到内核的代码,明白这些图表数据根源。

1. sar源码概览

sar作为sysstat一部分,相关的工具还包括sadc、sa1、sa2。

sa1负责收集并存储每天系统动态信息到一个二进制的文件中,sa1是sadc所涉及的程序前端程序。通常由计划任务工具cron来调用。打开sa1文件不难看出就是调用sadc今次那个采集数据。

sa2就是调用sar命令,将当日二进制日志文件数据存储到文本文件中。

sadc是系统动态数据收集工具,收集的数据被写入一个二进制文件中,它是sar工具后端。

sar负责解析sadc保存的数据,并显示出来。

当使用sar进行数据统计的时候,通过pstree `pidof sar`,可以看出sar调用了sadc。

sar───sadc

1.1 sadc信息采样

sadc入口在sadc.c中,主要是解析参数、启动一个interval alarm、rw_sa_stat_loop()读取数据。

通过alarm触发SIGALRM实现周期性读取,SIGINT停止读取。

int main(int argc, char **argv)
{
    int opt = 0;
    char ofile[MAX_FILE_LEN], sa_dir[MAX_FILE_LEN];
    int stdfd = 0, ofd = -1;
    int restart_mark;
    long count = 0;

    /* Get HZ */
    get_HZ();

    /* Compute page shift in kB */
    get_kb_shift();

    ofile[0] = sa_dir[0] = comment[0] = '\0';

...-----------------------------------------------------------解析参数并进行配置。

    /* Set a handler for SIGALRM */
    memset(&alrm_act, 0, sizeof(alrm_act));
    alrm_act.sa_handler = alarm_handler;
    sigaction(SIGALRM, &alrm_act, NULL);
    alarm(interval);------------------------------------------启动一个alarm,超时未interval,并通过在alarm_handler()中重新起一个alarm实现周期性处理。

    /* Main loop */
    rw_sa_stat_loop(count, stdfd, ofd, ofile, sa_dir);

#ifdef HAVE_SENSORS
    /* Cleanup sensors */
    sensors_cleanup();
#endif /* HAVE_SENSORS */

    /* Free structures */
    sa_sys_free();

    return 0;
}

rw_sa_stat_loop()是整个sadc的核心循环,这里从个sysfs读取信息,并提取关键信息,然后保存。

void rw_sa_stat_loop(long count, int stdfd, int ofd, char ofile[],
             char sa_dir[])
{
    int do_sa_rotat = 0;
    unsigned int save_flags;
    char new_ofile[MAX_FILE_LEN] = "";
    struct tm rectime = {0, 0, 0, 0, 0, 0, 0, 0, 0, 0, NULL};

    /* Set a handler for SIGINT */
    memset(&int_act, 0, sizeof(int_act));
    int_act.sa_handler = int_handler;
    sigaction(SIGINT, &int_act, NULL);
/* Main loop */
    do {

        reset_stats();
...
        /* Read then write stats */
        read_stats();--------------------------------------------------------遍历act[]中所有的struct activity,进行采样。

        if (stdfd >= 0) {
            save_flags = flags;
            flags &= ~S_F_LOCK_FILE;
            write_stats(stdfd);----------------------------------------------通过标准输出文件打印信息。
            flags = save_flags;
        }

        /* If the record type was R_LAST_STATS, tag it R_STATS before writing it */
        record_hdr.record_type = R_STATS;
        if (ofile[0]) {
            write_stats(ofd);------------------------------------------------将结果写到指定文件中。
        }
...
        fflush(stdout);

        if (count > 0) {
            count--;
        }

        if (count) {
            /* Wait for a signal (probably SIGALRM or SIGINT) */
            pause();----------------------------------------------------------此处和alarm()配合达到周期性采样的效果。
        }

        if (sigint_caught)----------------------------------------------------如果收到SIGINT信号,提前终止采样。
            /* SIGINT caught: Stop now */
            break;
...
    }
    while (count);------------------------------------------------------------达到总采样数,同样停止采样。

    /* Close file descriptors if they have actually been used */
    CLOSE(stdfd);
    CLOSE(ofd);
}

read_stats()是核心采集数据函数,核心数据结构式act[]。

void read_stats(void)
{
    int i;
    __nr_t cpu_nr = act[get_activity_position(act, A_CPU, EXIT_IF_NOT_FOUND)]->nr;

    record_hdr.uptime0 = 0;
    if (cpu_nr > 2) {
        read_uptime(&(record_hdr.uptime0));
    }

    for (i = 0; i < NR_ACT; i++) {
        if (IS_COLLECTED(act[i]->options)) {---------------------------------------遍历所有act[],如果act[]中对应的options包含AO_COLLECTED则进行f_read()。
            /* Read statistics for current activity */            (*act[i]->f_read)(act[i]);
        }
    }

    if (cpu_nr == 1) {
        record_hdr.uptime0 = record_hdr.uptime;
    }
}

act[]存放了所有统计事件的struct activity。

struct activity *act[NR_ACT] = {
    &cpu_act,
    &pcsw_act,
    &irq_act,
    &swap_act,
    &paging_act,
    &io_act,
    &memory_act,
    &huge_act,
    &ktables_act,
    &queue_act,
    &serial_act,
    &disk_act,
    /* <network> */
    &net_dev_act,
    &net_edev_act,
    &net_nfs_act,
    &net_nfsd_act,
    &net_sock_act,
    &net_ip_act,
    &net_eip_act,
    &net_icmp_act,
    &net_eicmp_act,
    &net_tcp_act,
    &net_etcp_act,
    &net_udp_act,
    &net_sock6_act,
    &net_ip6_act,
    &net_eip6_act,
    &net_icmp6_act,
    &net_eicmp6_act,
    &net_udp6_act,
    &fchost_act,
    &softnet_act,    /* AO_CLOSE_MARKUP */
    /* </network> */
    /* <power-management> */
    &pwr_cpufreq_act,
    &pwr_fan_act,
    &pwr_temp_act,
    &pwr_in_act,
    &pwr_wghfreq_act,
    &pwr_usb_act,        /* AO_CLOSE_MARKUP */
    /* </power-management> */
    &filesystem_act
};

 

1.2 sar显示统计信息

read_sadc_stat_bunch()读取统计信息,write_stats()调用每个struct activity的f_print()函数,write_stats_avg()调用每个struct activity的f_print_avg()函数。

f_print()和f_print_avg()或从文件中解析字符串,或启动sadc采样,然后再解析。

2 ksar处理流程

2.1 ksar介绍

ksar资源:ksar安装文件

ksar中看到的图标是结果,这些数据是通过sadc采集,sar解析出来的。

sadc是通过读取sysfs/procfs节点来获取信息,这些节点都是内核提供的统计信息。

sar -o temp.bin 1 600--------------------------------------------------sar将采样数据保存在temp.bin中。

LC_ALL=C sar -A -f temp.bin > sar.txt----------------------------将保存的采样数据temp.bin,转换成更可读性强的文本文件。

所以ksar的每一张图标,都对应了内核统计信息。

sar是数据搬运整理工具,ksar是图形化工具。

下面对每张图标从ksar,到sar/sadc,最终到内核中每个数据。

 

2.2 ksar操作

2.2.1 数据导入

通过Data->Append from a file...从txt中加载数据,还有其他两种数据来源方式。

2.2.2 数据导出

如果要将图表导出,可以在每张图标下面选择Export PNG之类。

或者通过Export->Export to PDF...,选择指定选项。

3. ksar解析

下面结合ksar图表来逐项解析。

3.1 CPU信息

cpu_act读取/proc/stat节点,解析其中cpu信息,包括合计cpu信息以及单个cpu信息。

/proc/stat中的信息包括一个合计以及多个cpu单独统计信息。

从/proc/stat中读取的信息都是从启动以来的累计时间,在图标中显示的是一个时间段的差值。

然后计算不同模块耗时占比。

void read_stat_cpu(struct stats_cpu *st_cpu, int nbr,
           unsigned long long *uptime, unsigned long long *uptime0)
{
    FILE *fp;
    struct stats_cpu *st_cpu_i;
    struct stats_cpu sc;
    char line[8192];
    int proc_nb;

    if ((fp = fopen(STAT, "r")) == NULL) {----------------------------------------------------/proc/stat
        fprintf(stderr, _("Cannot open %s: %s\n"), STAT, strerror(errno));
        exit(2);
    }

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "cpu ", 4)) {------------------------------------------------------统计总cpu不同类别耗时,详细信息参考/proc/stat。这里的信息和内核中一一对应。
            memset(st_cpu, 0, STATS_CPU_SIZE);
            sscanf(line + 5, "%llu %llu %llu %llu %llu %llu %llu %llu %llu %llu",
                   &st_cpu->cpu_user,
                   &st_cpu->cpu_nice,
                   &st_cpu->cpu_sys,
                   &st_cpu->cpu_idle,
                   &st_cpu->cpu_iowait,
                   &st_cpu->cpu_hardirq,
                   &st_cpu->cpu_softirq,
                   &st_cpu->cpu_steal,
                   &st_cpu->cpu_guest,
                   &st_cpu->cpu_guest_nice);

            *uptime = st_cpu->cpu_user + st_cpu->cpu_nice    +
                st_cpu->cpu_sys    + st_cpu->cpu_idle    +
                st_cpu->cpu_iowait + st_cpu->cpu_hardirq +
                st_cpu->cpu_steal  + st_cpu->cpu_softirq;
        }

        else if (!strncmp(line, "cpu", 3)) {
...
    }

    fclose(fp);
}

3.2 进程创建及切换

pcsw_act读取/proc/stat节点,解析其中的ctxt和processes信息。

同样的/proc/stat中,统计信息是启动以来的累计值,图标中现实的不同时间段的差值。

proc/s表示每秒创建的进程数目,cswch/s表示每秒进程切换次数。

void read_stat_pcsw(struct stats_pcsw *st_pcsw)
{
    FILE *fp;
    char line[8192];

    if ((fp = fopen(STAT, "r")) == NULL)
        return;

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "ctxt ", 5)) {--------------------------------------------ctxt是所有CPU的进程切换次数。
            /* Read number of context switches */
            sscanf(line + 5, "%llu", &st_pcsw->context_switch);
        }

        else if (!strncmp(line, "processes ", 10)) {---------------------------------是整个系统创建进程的次数,total_forks,
            /* Read number of processes created since system boot */
            sscanf(line + 10, "%lu", &st_pcsw->processes);
        }
    }

    fclose(fp);
}

3.3 swap信息

swap_act读取/proc/vmstat节点,解析其中的pswpin和pswpout两个信息。

pswpin/s表示系统每秒从swap分区读入的页面数量,即移除掉swap;pswpout/s表示系统每秒写到swap分区的页面数量,即产生swap。

void read_vmstat_swap(struct stats_swap *st_swap)
{
    FILE *fp;
    char line[128];

    if ((fp = fopen(VMSTAT, "r")) == NULL)
        return;

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "pswpin ", 7)) {-------------------------------对应PSWPIN,表示从swap分区读入页面。
            /* Read number of swap pages brought in */
            sscanf(line + 7, "%lu", &st_swap->pswpin);
        }
        else if (!strncmp(line, "pswpout ", 8)) {-------------------------对应PSWPOUT,表示将page写入到swap分区。
            /* Read number of swap pages brought out */
            sscanf(line + 8, "%lu", &st_swap->pswpout);
        }
    }

    fclose(fp);
}

3.4 页面统计信息

paging_act统计/proc/vmstat中页面交换统计信息。

Paging包括四张图标,分别统计/proc/vmstat中的PGPGIN、PGPGOUT、PGFAULT、PGMAJFAULT等等信息。

pgpgin/s表示系统每秒从磁盘中paged多少KB;pgpgout/s则表示到磁盘中多少KB。

fault/s表示系统每秒产生的页面异常数目,包括major和minor;majflt/s则表示页面异常是从磁盘中产生的。

pgfree/s表示每秒放入到free list的页面数量;pgscank/s、pgscand/s都表示扫描的页面数量,只不过前者表示kswapd扫描结果,后者表示直接扫描;pgsteal/s都表示从pagecache和swapcache中回收的页面数量。

%vmeff是pgsteal/pgscan表示页面回收的效率。

看fault/s和pgfree/s存在一定关系,前者表示内存申请,后者表示内存释放,存在先后关系。但是单位不一致,前者是次数,后者是页面。

void read_vmstat_paging(struct stats_paging *st_paging)
{
    FILE *fp;
    char line[128];
    unsigned long pgtmp;

    if ((fp = fopen(VMSTAT, "r")) == NULL)
        return;

    st_paging->pgsteal = 0;
    st_paging->pgscan_kswapd = st_paging->pgscan_direct = 0;

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "pgpgin ", 7)) {------------------------------------------------pgpgin和pgpgout分别对应PGPGIN和PGPGOUT,在submit_io()中更新。指内存和块设备志坚page数目,这里的page指的是disk sector。pgpgin表示从块设备读入,pgpgout写入到跨设备。
            /* Read number of pages the system paged in */
            sscanf(line + 7, "%lu", &st_paging->pgpgin);
        }
        else if (!strncmp(line, "pgpgout ", 8)) {
            /* Read number of pages the system paged out */
            sscanf(line + 8, "%lu", &st_paging->pgpgout);
        }
        else if (!strncmp(line, "pgfault ", 8)) {-------------------------------------------在vm_event_item[]中对应PGFAULT,在handle_mm_fault()中更新。统计产生page fault的信息。
            /* Read number of faults (major+minor) made by the system */
            sscanf(line + 8, "%lu", &st_paging->pgfault);
        }
        else if (!strncmp(line, "pgmajfault ", 11)) {---------------------------------------对应PGMAJFAULT,表示从磁盘而不是从内存中获取数据。
            /* Read number of faults (major only) made by the system */
            sscanf(line + 11, "%lu", &st_paging->pgmajfault);
        }
        else if (!strncmp(line, "pgfree ", 7)) {--------------------------------------------对应PGFREE,统计释放的页面次数。
            /* Read number of pages freed by the system */
            sscanf(line + 7, "%lu", &st_paging->pgfree);
        }
        else if (!strncmp(line, "pgsteal_", 8)) {-------------------------------------------统计PGSTEAL_KSWAPD和PGSTEAL_DIRECT信息,表示系统回收的kswapd和直接回收的页面数目。
            /* Read number of pages stolen by the system */
            sscanf(strchr(line, ' '), "%lu", &pgtmp);
            st_paging->pgsteal += pgtmp;
        }
        else if (!strncmp(line, "pgscan_kswapd", 13)) {--------------------------------------统计PGSCAN_KSWAPD信息,表示从系统启动到现在kswapd后台进程扫描的页面数。
            /* Read number of pages scanned by the kswapd daemon */
            sscanf(strchr(line, ' '), "%lu", &pgtmp);
            st_paging->pgscan_kswapd += pgtmp;
        }
        else if (!strncmp(line, "pgscan_direct", 13)) {--------------------------------------统计PGSCAN_DIRECT和PGSCAN_DIRECT_THROTTLE,统计世界回收页面数。
            /* Read number of pages scanned directly */
            sscanf(strchr(line, ' '), "%lu", &pgtmp);
            st_paging->pgscan_direct += pgtmp;
        }
    }

    fclose(fp);
}

3.5 IO统计信息

io_act从/proc/diskstats读取IO统计信息。

上面的tps、rtps、wtps、bread/s、bwrtn/s分别对应采样结果dk_drive、dk_drive_rio、dk_drive_wio、dk_drive_rblk、dk_drive_wblk。

其中tps为rtps和wtps之和,表示完成读写次数;bread/s和bwrtn/s表示读、写扇区数目。

这5个参数都来源于diskstats_show()函数。

void read_diskstats_io(struct stats_io *st_io)
{
    FILE *fp;
    char line[256];
    char dev_name[MAX_NAME_LEN];
    unsigned int major, minor;
    unsigned long rd_ios, wr_ios, rd_sec, wr_sec;

    if ((fp = fopen(DISKSTATS, "r")) == NULL)
        return;

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (sscanf(line, "%u %u %s %lu %*u %lu %*u %lu %*u %lu",-----------scanf的%u之间的星号,表示跳过此输入。
               &major, &minor, dev_name,
               &rd_ios, &rd_sec, &wr_ios, &wr_sec) == 7) {

            if (is_device(dev_name, IGNORE_VIRTUAL_DEVICES)) {--------------虚拟设备没有/sys/block/<device>/device,此特性用以判断dev_name对应的设备是否是真实的块设备。
                st_io->dk_drive      += (unsigned long long) rd_ios + (unsigned long long) wr_ios;
                st_io->dk_drive_rio  += rd_ios;
                st_io->dk_drive_rblk += rd_sec;
                st_io->dk_drive_wio  += wr_ios;
                st_io->dk_drive_wblk += wr_sec;
            }
        }
    }
    fclose(fp);
}

/proc/diskstats来源于内核的diskstats_show()

static int diskstats_show(struct seq_file *seqf, void *v)
{
    struct gendisk *gp = v;
    struct disk_part_iter piter;
    struct hd_struct *hd;
    char buf[BDEVNAME_SIZE];
    int cpu;

    /*
    if (&disk_to_dev(gp)->kobj.entry == block_class.devices.next)
        seq_puts(seqf,    "major minor name"
                "     rio rmerge rsect ruse wio wmerge "
                "wsect wuse running use aveq"
                "\n\n");
    */

    disk_part_iter_init(&piter, gp, DISK_PITER_INCL_EMPTY_PART0);
    while ((hd = disk_part_iter_next(&piter))) {
        cpu = part_stat_lock();
        part_round_stats(cpu, hd);
        part_stat_unlock();
        seq_printf(seqf, "%4d %7d %s %lu %lu %lu "
               "%u %lu %lu %lu %u %u %u %u\n",
               MAJOR(part_devt(hd)), MINOR(part_devt(hd)),
               disk_name(gp, hd->partno, buf),
               part_stat_read(hd, ios[READ]),-------------------------------------------------成功完成读的次数。
               part_stat_read(hd, merges[READ]),----------------------------------------------合并读次数,为了效率可能会合并相邻的读和写。
               part_stat_read(hd, sectors[READ]),---------------------------------------------读扇区的次数。
               jiffies_to_msecs(part_stat_read(hd, ticks[READ])),-----------------------------读花的时间,这里是所有读操作所花费的毫秒数。
               part_stat_read(hd, ios[WRITE]),------------------------------------------------成功完成写的次数。
               part_stat_read(hd, merges[WRITE]),---------------------------------------------合并写次数。
               part_stat_read(hd, sectors[WRITE]),--------------------------------------------写扇区次数。
               jiffies_to_msecs(part_stat_read(hd, ticks[WRITE])),----------------------------写花的时间。
               part_in_flight(hd),
               jiffies_to_msecs(part_stat_read(hd, io_ticks)),
               jiffies_to_msecs(part_stat_read(hd, time_in_queue))
            );
    }
    disk_part_iter_exit(&piter);

    return 0;
}

3.6 内存及swap统计信息

memory_act统计/proc/meminfo的内存和swap信息。

kbmemfree是所有free内存大小,kbavail是扣除保留内存,加上部分pagecache和可回收内存;kbmemused是目前系统内存使用量,不包括内核使用量。

%memused是内存使用占总内存百分比。

kbcommit是当前场景系统需要使用到的内存量,实际上可能并没有申请这么多内存。这是因为分配的内存只有在使用到时,才会产生缺页异常。

这里的kbcommit和页面统计信息相呼应,此处kbcommit突然增加,产生了很多page fault。

从图表中的名称大概就能找到对应的/proc/meminfo中的字符项。

其中%memused和%commit查看stub_print_memory_stats(),%memused为(MemTotal-MemFree)/MemTotal。

%commit为Commited_AS/(MemTotal+SwapTotal)。

在/proc/meminfo中看到的CommitLimit可能是proc/sys/vm/overcommit_kbytes;或者按照当前可以用内存乘以overcommit_ratio这个比例得到。

在实际使用中,还需要结合overcommit_memory类型来看,参考overcommit_memoryovercommit_ratio

unsigned long vm_commit_limit(void)
{
    unsigned long allowed;

    if (sysctl_overcommit_kbytes)
        allowed = sysctl_overcommit_kbytes >> (PAGE_SHIFT - 10);
    else
        allowed = ((totalram_pages - hugetlb_total_pages())
               * sysctl_overcommit_ratio / 100);
    allowed += total_swap_pages;

    return allowed;
}

read_meminfo()从/proc/meminfo中解析数据。

void read_meminfo(struct stats_memory *st_memory)
{
    FILE *fp;
    char line[128];

    if ((fp = fopen(MEMINFO, "r")) == NULL)
        return;

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "MemTotal:", 9)) {
            /* Read the total amount of memory in kB */
            sscanf(line + 9, "%lu", &st_memory->tlmkb);
        }
        else if (!strncmp(line, "MemFree:", 8)) {
            /* Read the amount of free memory in kB */
            sscanf(line + 8, "%lu", &st_memory->frmkb);
        }
        else if (!strncmp(line, "MemAvailable:", 13)) {
            /* Read the amount of available memory in kB */
            sscanf(line + 13, "%lu", &st_memory->availablekb);
        }
        else if (!strncmp(line, "Buffers:", 8)) {
            /* Read the amount of buffered memory in kB */
            sscanf(line + 8, "%lu", &st_memory->bufkb);
        }
        else if (!strncmp(line, "Cached:", 7)) {
            /* Read the amount of cached memory in kB */
            sscanf(line + 7, "%lu", &st_memory->camkb);
        }
        else if (!strncmp(line, "SwapCached:", 11)) {
            /* Read the amount of cached swap in kB */
            sscanf(line + 11, "%lu", &st_memory->caskb);
        }
        else if (!strncmp(line, "Active:", 7)) {
            /* Read the amount of active memory in kB */
            sscanf(line + 7, "%lu", &st_memory->activekb);
        }
        else if (!strncmp(line, "Inactive:", 9)) {
            /* Read the amount of inactive memory in kB */
            sscanf(line + 9, "%lu", &st_memory->inactkb);
        }
        else if (!strncmp(line, "SwapTotal:", 10)) {
            /* Read the total amount of swap memory in kB */
            sscanf(line + 10, "%lu", &st_memory->tlskb);
        }
        else if (!strncmp(line, "SwapFree:", 9)) {
            /* Read the amount of free swap memory in kB */
            sscanf(line + 9, "%lu", &st_memory->frskb);
        }
        else if (!strncmp(line, "Dirty:", 6)) {
            /* Read the amount of dirty memory in kB */
            sscanf(line + 6, "%lu", &st_memory->dirtykb);
        }
        else if (!strncmp(line, "Committed_AS:", 13)) {
            /* Read the amount of commited memory in kB */
            sscanf(line + 13, "%lu", &st_memory->comkb);
        }
        else if (!strncmp(line, "AnonPages:", 10)) {
            /* Read the amount of pages mapped into userspace page tables in kB */
            sscanf(line + 10, "%lu", &st_memory->anonpgkb);
        }
        else if (!strncmp(line, "Slab:", 5)) {
            /* Read the amount of in-kernel data structures cache in kB */
            sscanf(line + 5, "%lu", &st_memory->slabkb);
        }
        else if (!strncmp(line, "KernelStack:", 12)) {
            /* Read the kernel stack utilization in kB */
            sscanf(line + 12, "%lu", &st_memory->kstackkb);
        }
        else if (!strncmp(line, "PageTables:", 11)) {
            /* Read the amount of memory dedicated to the lowest level of page tables in kB */
            sscanf(line + 11, "%lu", &st_memory->pgtblkb);
        }
        else if (!strncmp(line, "VmallocUsed:", 12)) {
            /* Read the amount of vmalloc area which is used in kB */
            sscanf(line + 12, "%lu", &st_memory->vmusedkb);
        }
    }

    fclose(fp);
}

下图的kbswpfree、kbswpused、kbswpcad通过字面意思即可知其对应的meminfo项为SwapFree、SwapTotal-SwapFree、SwapCached。

3.7 中断统计信息

中断你统计信息同样来自/proc/stat,解析intr字段。第一个是启动以来所有中断触发次数,后面是单个中断触发次数。

生成的表格中包含了对应的图标sum,以及每个中断统计信息。

void read_stat_irq(struct stats_irq *st_irq, int nbr)
{
    FILE *fp;
    struct stats_irq *st_irq_i;
    char line[8192];
    int i, pos;

    if ((fp = fopen(STAT, "r")) == NULL)
        return;

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "intr ", 5)) {-------------------------------------------------/proc/stat的intr字段。
            /* Read total number of interrupts received since system boot */
            sscanf(line + 5, "%llu", &st_irq->irq_nr);
            pos = strcspn(line + 5, " ") + 5;

            for (i = 1; i < nbr; i++) {
                st_irq_i = st_irq + i;
                sscanf(line + pos, " %llu", &st_irq_i->irq_nr);
                pos += strcspn(line + pos + 1, " ") + 1;
            }
        }
    }

    fclose(fp);
}

3.8 

ktables_act

#define FDENTRY_STATE    "/proc/sys/fs/dentry-state"
#define FFILE_NR    "/proc/sys/fs/file-nr"
#define FINODE_STATE    "/proc/sys/fs/inode-state"
#define PTY_NR        "/proc/sys/kernel/pty/nr"

void read_kernel_tables(struct stats_ktables *st_ktables)
{
    FILE *fp;
    unsigned int parm;
    int rc = 0;

    /* Open /proc/sys/fs/dentry-state file */
    if ((fp = fopen(FDENTRY_STATE, "r")) != NULL) {
        rc = fscanf(fp, "%*d %u",
                &st_ktables->dentry_stat);
        fclose(fp);
        if (rc == 0) {
            st_ktables->dentry_stat = 0;
        }
    }

    /* Open /proc/sys/fs/file-nr file */
    if ((fp = fopen(FFILE_NR, "r")) != NULL) {
        rc = fscanf(fp, "%u %u",
                &st_ktables->file_used, &parm);
        fclose(fp);
        /*
         * The number of used handles is the number of allocated ones
         * minus the number of free ones.
         */
        if (rc == 2) {
            st_ktables->file_used -= parm;
        }
        else {
            st_ktables->file_used = 0;
        }
    }

    /* Open /proc/sys/fs/inode-state file */
    if ((fp = fopen(FINODE_STATE, "r")) != NULL) {
        rc = fscanf(fp, "%u %u",
                &st_ktables->inode_used, &parm);
        fclose(fp);
        /*
         * The number of inuse inodes is the number of allocated ones
         * minus the number of free ones.
         */
        if (rc == 2) {
            st_ktables->inode_used -= parm;
        }
        else {
            st_ktables->inode_used = 0;
        }
    }

    /* Open /proc/sys/kernel/pty/nr file */
    if ((fp = fopen(PTY_NR, "r")) != NULL) {
        rc = fscanf(fp, "%u",
                &st_ktables->pty_nr);
        fclose(fp);
        if (rc == 0) {
            st_ktables->pty_nr = 0;
        }
    }
}

3.9 load和queue统计信息

queue_act从/proc/loadavg获取load信息,从/proc/stat获取queue统计信息。

ldavg-1、ldavg-5、ldavg-15分别表示1、5、15分钟内进程对立中平均进程数目,包括正在运行的进程和准备好等待运行的进程数目。

runq-sz表示运行队列大小,plist-sz表示所有进程总数nr_threads。

runq-sz大表示等待运行的进程数目较多。

blocked表示处于iowait状态的进程数目。

void read_loadavg(struct stats_queue *st_queue)
{
    FILE *fp;
    char line[8192];
    int load_tmp[3];
    int rc;

    if ((fp = fopen(LOADAVG, "r")) == NULL)
        return;

    /* Read load averages and queue length */
    rc = fscanf(fp, "%d.%u %d.%u %d.%u %lu/%u %*d\n",------------------------------可以看出从/proc/loadavg中获取信息,然后最后一个数据忽略。
            &load_tmp[0], &st_queue->load_avg_1,
            &load_tmp[1], &st_queue->load_avg_5,
            &load_tmp[2], &st_queue->load_avg_15,
            &st_queue->nr_running,
            &st_queue->nr_threads);

    fclose(fp);

    if (rc < 8)
        return;

    st_queue->load_avg_1  += load_tmp[0] * 100;
    st_queue->load_avg_5  += load_tmp[1] * 100;
    st_queue->load_avg_15 += load_tmp[2] * 100;

    if (st_queue->nr_running) {
        /* Do not take current process into account */
        st_queue->nr_running--;
    }

    /* Read nr of tasks blocked from /proc/stat */
    if ((fp = fopen(STAT, "r")) == NULL)
        return;

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "procs_blocked ", 14)) {--------------------------------从/proc/stat中获取procs_blocked项,表示处于iowait状态的进程数目。
            /* Read number of processes blocked */
            sscanf(line + 14, "%lu", &st_queue->procs_blocked);
            break;
        }
    }

    fclose(fp);
}

3.10 块设备统计信息

disk_act和io_act一样是从/proc/diskstats中获取统计信息,

图标中的四项tps表示块设备读写次数频率;rkB/s和wkB/s表示块设备读写速率;await表示一次读或写耗时。

tps值大说明此段时间读写较频繁。

await值大表示当前读写耗时较大,可能存在问题。

__print_funct_t print_disk_stats(struct activity *a, int prev, int curr,
                 unsigned long long itv)
{
...

    for (i = 0; i < a->nr; i++) {
...
        printf("%-11s", timestamp[curr]);

        cprintf_in(IS_STR, " %9s", dev_name, 0);-----------------------------sdc是设备当前统计信息,sdp是设备前一次统计信息,itv是两次时间间隔。
        cprintf_f(NO_UNIT, 1, 9, 2,
              S_VALUE(sdp->nr_ios, sdc->nr_ios,  itv));----------------------对应图标中的tps,nr_ios表示块设备读写次数之和。tps表示块设备读写频率。
        cprintf_f(unit, 2, 9, 2,
              S_VALUE(sdp->rd_sect, sdc->rd_sect, itv) / 2,
              S_VALUE(sdp->wr_sect, sdc->wr_sect, itv) / 2);-----------------对应图标的rkB/s和wkB/s,同样rd_sect和wr_sect可以计算出以KB为单位的读写速率。
        /* See iostat for explanations */
        cprintf_f(unit, 1, 9, 2,
              xds.arqsz / 2);
        cprintf_f(NO_UNIT, 3, 9, 2,
              S_VALUE(sdp->rq_ticks, sdc->rq_ticks, itv) / 1000.0,
              xds.await,------------------------------------------------------对应await,在compute_ext_disk_stats()中计算。表示每次读写平均耗时。
              xds.svctm);
        cprintf_pc(DISPLAY_UNIT(flags), 1, 9, 2,
               xds.util / 10.0);
        printf("\n");
    }
}

3.11 CPU频率

pwr_cpufreq_act从/proc/cpuinfo中获取频率信息,CPU Frequency all显示的是所有CPU频率的平均值。

从下面代码分析可知,从关键词processor中获取CPU号,从cpu MHz获取CPU频率。

void read_cpuinfo(struct stats_pwr_cpufreq *st_pwr_cpufreq, int nbr)
{
    FILE *fp;
    struct stats_pwr_cpufreq *st_pwr_cpufreq_i;
    char line[1024];
    int nr = 0;
    unsigned int proc_nb = 0, ifreq, dfreq;

    if ((fp = fopen(CPUINFO, "r")) == NULL)
        return;

    st_pwr_cpufreq->cpufreq = 0;

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "processor\t", 10)) {
            sscanf(strchr(line, ':') + 1, "%u", &proc_nb);
        }
        /* Entry in /proc/cpuinfo is different between Intel and Power architectures */
        else if (!strncmp(line, "cpu MHz\t", 8) ||
             !strncmp(line, "clock\t", 6)) {
            sscanf(strchr(line, ':') + 1, "%u.%u", &ifreq, &dfreq);

            if (proc_nb < (nbr - 1)) {
                st_pwr_cpufreq_i = st_pwr_cpufreq + proc_nb + 1;
                st_pwr_cpufreq_i->cpufreq = ifreq * 100 + dfreq / 10;

                st_pwr_cpufreq->cpufreq += st_pwr_cpufreq_i->cpufreq;
                nr++;
            }
            else if (!proc_nb && (nbr == 1)) {
                st_pwr_cpufreq->cpufreq = ifreq * 100 + dfreq / 10;
            }
        }
    }

    fclose(fp);
...
}

3.12 统计串口线信息

serial_act从/proc/tty/driver/serial获取串口线统计信息。

void read_tty_driver_serial(struct stats_serial *st_serial, int nbr)
{
    FILE *fp;
    struct stats_serial *st_serial_i;
    int sl = 0;
    char line[256];
    char *p;

    if ((fp = fopen(SERIAL, "r")) == NULL)
        return;

    while ((fgets(line, sizeof(line), fp) != NULL) && (sl < nbr)) {

        if ((p = strstr(line, "tx:")) != NULL) {
            st_serial_i = st_serial + sl;
            sscanf(line, "%u", &st_serial_i->line);
            /*
             * A value of 0 means an unused structure.
             * So increment it to make sure it is not zero.
             */
            (st_serial_i->line)++;
            /*
             * Read the number of chars transmitted and received by
             * current serial line.
             */
            sscanf(p + 3, "%u", &st_serial_i->tx);
            if ((p = strstr(line, "rx:")) != NULL) {
                sscanf(p + 3, "%u", &st_serial_i->rx);
            }
            if ((p = strstr(line, "fe:")) != NULL) {
                sscanf(p + 3, "%u", &st_serial_i->frame);
            }
            if ((p = strstr(line, "pe:")) != NULL) {
                sscanf(p + 3, "%u", &st_serial_i->parity);
            }
            if ((p = strstr(line, "brk:")) != NULL) {
                sscanf(p + 4, "%u", &st_serial_i->brk);
            }
            if ((p = strstr(line, "oe:")) != NULL) {
                sscanf(p + 3, "%u", &st_serial_i->overrun);
            }

            sl++;
        }
    }

    fclose(fp);
}

3.13 获取网口统计信息

net_dev_act从/proc/net/dev中获取网口的统计信息,从/sys/class/net/xxx/duplex和/sys/class/net/xxx/speed中获取双工和速度信息。

从下面关于/proc/net/dev解释可知,rxpck对应rx_packets,rxpck/s就是每秒接收packet数;txpck/s就是每个发送packet数目。

rxkB/s是每秒接受多少k字节;txkB/s是每秒发送多少k字节。

rxmcst/s表示每秒接收到的multicast packet数目。

 

%ifutil跟每秒发送接收的KB数有关,还跟全双工,单双工有关,以及设备的速度有关;表示网络设备使用率。

双工信息从/sys/class/net/%s/duplex获取,速度信息从从/sys/class/net/%s/speed获取。

从此图看,网络的使用率并不高,说明网络不频繁。

int read_net_dev(struct stats_net_dev *st_net_dev, int nbr)
{
    FILE *fp;
    struct stats_net_dev *st_net_dev_i;
    char line[256];
    char iface[MAX_IFACE_LEN];
    int dev = 0;
    int pos;

    if ((fp = fopen(NET_DEV, "r")) == NULL)----------------------------------------------------------------从/proc/net/dev获取信息。
        return 0;

    while ((fgets(line, sizeof(line), fp) != NULL) && (dev < nbr)) {

        pos = strcspn(line, ":");
        if (pos < strlen(line)) {
            st_net_dev_i = st_net_dev + dev;
            strncpy(iface, line, MINIMUM(pos, MAX_IFACE_LEN - 1));
            iface[MINIMUM(pos, MAX_IFACE_LEN - 1)] = '\0';
            sscanf(iface, "%s", st_net_dev_i->interface); /* Skip heading spaces */
            sscanf(line + pos + 1, "%llu %llu %*u %*u %*u %*u %llu %llu %llu %llu "
                   "%*u %*u %*u %*u %*u %llu",
                   &st_net_dev_i->rx_bytes,
                   &st_net_dev_i->rx_packets,
                   &st_net_dev_i->rx_compressed,
                   &st_net_dev_i->multicast,
                   &st_net_dev_i->tx_bytes,
                   &st_net_dev_i->tx_packets,
                   &st_net_dev_i->tx_compressed);
            dev++;
        }
    }

    fclose(fp);

    return dev;
}

void read_if_info(struct stats_net_dev *st_net_dev, int nbr)
{
    FILE *fp;
    struct stats_net_dev *st_net_dev_i;
    char filename[128], duplex[32];
    int dev, n;

    for (dev = 0; dev < nbr; dev++) {

        st_net_dev_i = st_net_dev + dev;

        /* Read speed info */
        sprintf(filename, IF_DUPLEX, st_net_dev_i->interface);----------------------------------------------从/sys/class/net/%s/duplex获取信息。

        if ((fp = fopen(filename, "r")) == NULL)
            /* Cannot read NIC duplex */
            continue;

        n = fscanf(fp, "%31s", duplex);

        fclose(fp);

        if (n != 1)
            /* Cannot read NIC duplex */
            continue;

        if (!strcmp(duplex, K_DUPLEX_FULL)) {
            st_net_dev_i->duplex = C_DUPLEX_FULL;
        }
        else if (!strcmp(duplex, K_DUPLEX_HALF)) {
            st_net_dev_i->duplex = C_DUPLEX_HALF;
        }
        else
            continue;

        /* Read speed info */
        sprintf(filename, IF_SPEED, st_net_dev_i->interface);--------------------------------------------------从/sys/class/net/%s/speed获取信息。

        if ((fp = fopen(filename, "r")) == NULL)
            /* Cannot read NIC speed */
            continue;

        n = fscanf(fp, "%u", &st_net_dev_i->speed);

        fclose(fp);

        if (n != 1) {
            st_net_dev_i->speed = 0;
        }
    }
}

3.13.1 /proc/net/dev

/proc/net/dev节点在dev_proc_net_init()中创建,对应的fops是dev_seq_fops。

这里重点看一下dev_seq_show():

static int dev_seq_show(struct seq_file *seq, void *v)
{
    if (v == SEQ_START_TOKEN)
        seq_puts(seq, "Inter-|   Receive                            "
                  "                    |  Transmit\n"
                  " face |bytes    packets errs drop fifo frame "
                  "compressed multicast|bytes    packets errs "
                  "drop fifo colls carrier compressed\n");
    else
        dev_seq_printf_stats(seq, v);
    return 0;
}

static void dev_seq_printf_stats(struct seq_file *seq, struct net_device *dev)
{
    struct rtnl_link_stats64 temp;
    const struct rtnl_link_stats64 *stats = dev_get_stats(dev, &temp);

    seq_printf(seq, "%6s: %7llu %7llu %4llu %4llu %4llu %5llu %10llu %9llu "
           "%8llu %7llu %4llu %4llu %4llu %5llu %7llu %10llu\n",
           dev->name, stats->rx_bytes, stats->rx_packets,
           stats->rx_errors,
           stats->rx_dropped + stats->rx_missed_errors,
           stats->rx_fifo_errors,
           stats->rx_length_errors + stats->rx_over_errors +
            stats->rx_crc_errors + stats->rx_frame_errors,
           stats->rx_compressed, stats->multicast,
           stats->tx_bytes, stats->tx_packets,
           stats->tx_errors, stats->tx_dropped,
           stats->tx_fifo_errors, stats->collisions,
           stats->tx_carrier_errors +
            stats->tx_aborted_errors +
            stats->tx_window_errors +
            stats->tx_heartbeat_errors,
           stats->tx_compressed);
}

3.14 网络错误统计信息

net_edev_act从/proc/net/dev网络错误统计信息。

网络错误信息数据来源和网络信息一样,都来自于/proc/net/dev。

关键结构体也是struct rtnl_link_stats64

void read_net_edev(struct stats_net_edev *st_net_edev, int nbr)
{
    FILE *fp;
    struct stats_net_edev *st_net_edev_i;
    static char line[256];
    char iface[MAX_IFACE_LEN];
    int dev = 0;
    int pos;

    if ((fp = fopen(NET_DEV, "r")) == NULL)
        return;

    while ((fgets(line, sizeof(line), fp) != NULL) && (dev < nbr)) {

        pos = strcspn(line, ":");
        if (pos < strlen(line)) {
            st_net_edev_i = st_net_edev + dev;
            strncpy(iface, line, MINIMUM(pos, MAX_IFACE_LEN - 1));
            iface[MINIMUM(pos, MAX_IFACE_LEN - 1)] = '\0';
            sscanf(iface, "%s", st_net_edev_i->interface); /* Skip heading spaces */
            sscanf(line + pos + 1, "%*u %*u %llu %llu %llu %llu %*u %*u %*u %*u "
                   "%llu %llu %llu %llu %llu",
                   &st_net_edev_i->rx_errors,------------------对应rx_erros,表示接受到的bad packet。
                   &st_net_edev_i->rx_dropped,-----------------对应rx_dropped+rx_missed_errors。
                   &st_net_edev_i->rx_fifo_errors,-------------对应rx_fifo_errors。
                   &st_net_edev_i->rx_frame_errors,------------对应rx_length_errors+rx_over_errros。
                   &st_net_edev_i->tx_errors,
                   &st_net_edev_i->tx_dropped,
                   &st_net_edev_i->tx_fifo_errors,
                   &st_net_edev_i->collisions,-----------------对应collisions
                   &st_net_edev_i->tx_carrier_errors);---------包括tx_carrier_errors+tx_aborted_errors+tx_window_errors+tx_heartbeat_errors。
            dev++;
        }
    }

    fclose(fp);
}

获取网络设备统计信息的核心数据结构式struct rtnl_link_stat64

struct rtnl_link_stats64 {
    __u64    rx_packets;        /* total packets received    */
    __u64    tx_packets;        /* total packets transmitted    */
    __u64    rx_bytes;        /* total bytes received     */
    __u64    tx_bytes;        /* total bytes transmitted    */
    __u64    rx_errors;        /* bad packets received        */
    __u64    tx_errors;        /* packet transmit problems    */
    __u64    rx_dropped;        /* no space in linux buffers    */
    __u64    tx_dropped;        /* no space available in linux    */
    __u64    multicast;        /* multicast packets received    */
    __u64    collisions;

    /* detailed rx_errors: */
    __u64    rx_length_errors;
    __u64    rx_over_errors;        /* receiver ring buff overflow    */
    __u64    rx_crc_errors;        /* recved pkt with crc error    */
    __u64    rx_frame_errors;    /* recv'd frame alignment error */
    __u64    rx_fifo_errors;        /* recv'r fifo overrun        */
    __u64    rx_missed_errors;    /* receiver missed packet    */

    /* detailed tx_errors */
    __u64    tx_aborted_errors;
    __u64    tx_carrier_errors;
    __u64    tx_fifo_errors;
    __u64    tx_heartbeat_errors;
    __u64    tx_window_errors;

    /* for cslip etc */
    __u64    rx_compressed;
    __u64    tx_compressed;

    __u64    rx_nohandler;        /* dropped, no handler found    */
};

3.15 NFS客户端统计信息

net_nfs_act读取/proc/net/rpc/nfs来分析作为nfs客户端的统计信息。

void read_net_nfs(struct stats_net_nfs *st_net_nfs)
{
    FILE *fp;
    char line[256];
    unsigned int getattcnt = 0, accesscnt = 0, readcnt = 0, writecnt = 0;

    if ((fp = fopen(NET_RPC_NFS, "r")) == NULL)
        return;

    memset(st_net_nfs, 0, STATS_NET_NFS_SIZE);

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "rpc ", 4)) {
            sscanf(line + 4, "%u %u",
                   &st_net_nfs->nfs_rpccnt, &st_net_nfs->nfs_rpcretrans);
        }
        else if (!strncmp(line, "proc3 ", 6)) {
            sscanf(line + 6, "%*u %*u %u %*u %*u %u %*u %u %u",
                   &getattcnt, &accesscnt, &readcnt, &writecnt);

            st_net_nfs->nfs_getattcnt += getattcnt;
            st_net_nfs->nfs_accesscnt += accesscnt;
            st_net_nfs->nfs_readcnt   += readcnt;
            st_net_nfs->nfs_writecnt  += writecnt;
        }
        else if (!strncmp(line, "proc4 ", 6)) {
            sscanf(line + 6, "%*u %*u %u %u "
                   "%*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %u %u",
                   &readcnt, &writecnt, &accesscnt, &getattcnt);

            st_net_nfs->nfs_getattcnt += getattcnt;
            st_net_nfs->nfs_accesscnt += accesscnt;
            st_net_nfs->nfs_readcnt   += readcnt;
            st_net_nfs->nfs_writecnt  += writecnt;
        }
    }

    fclose(fp);
}

3.16 NFS服务端统计信息

net_nfsd_act

#define NET_RPC_NFSD "/proc/net/rpc/nfsd"

void read_net_nfsd(struct stats_net_nfsd *st_net_nfsd)
{
    FILE *fp;
    char line[256];
    unsigned int getattcnt = 0, accesscnt = 0, readcnt = 0, writecnt = 0;

    if ((fp = fopen(NET_RPC_NFSD, "r")) == NULL)
        return;

    memset(st_net_nfsd, 0, STATS_NET_NFSD_SIZE);

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "rc ", 3)) {
            sscanf(line + 3, "%u %u",
                   &st_net_nfsd->nfsd_rchits, &st_net_nfsd->nfsd_rcmisses);
        }
        else if (!strncmp(line, "net ", 4)) {
            sscanf(line + 4, "%u %u %u",
                   &st_net_nfsd->nfsd_netcnt, &st_net_nfsd->nfsd_netudpcnt,
                   &st_net_nfsd->nfsd_nettcpcnt);
        }
        else if (!strncmp(line, "rpc ", 4)) {
            sscanf(line + 4, "%u %u",
                   &st_net_nfsd->nfsd_rpccnt, &st_net_nfsd->nfsd_rpcbad);
        }
        else if (!strncmp(line, "proc3 ", 6)) {
            sscanf(line + 6, "%*u %*u %u %*u %*u %u %*u %u %u",
                   &getattcnt, &accesscnt, &readcnt, &writecnt);

            st_net_nfsd->nfsd_getattcnt += getattcnt;
            st_net_nfsd->nfsd_accesscnt += accesscnt;
            st_net_nfsd->nfsd_readcnt   += readcnt;
            st_net_nfsd->nfsd_writecnt  += writecnt;

        }
        else if (!strncmp(line, "proc4ops ", 9)) {
            sscanf(line + 9, "%*u %*u %*u %*u %u "
                   "%*u %*u %*u %*u %*u %u "
                   "%*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %u "
                   "%*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %*u %u",
                   &accesscnt, &getattcnt, &readcnt, &writecnt);

            st_net_nfsd->nfsd_getattcnt += getattcnt;
            st_net_nfsd->nfsd_accesscnt += accesscnt;
            st_net_nfsd->nfsd_readcnt   += readcnt;
            st_net_nfsd->nfsd_writecnt  += writecnt;
        }
    }

    fclose(fp);
}

3.17 网络socket统计信息

net_sock_act读取/proc/net/sockstat统计TCP和UDP信息。

/proc/net/sockstat的数据来源于sockstat_seq_show(),这里面统计了当前系统socket的使用情况。

static int sockstat_seq_show(struct seq_file *seq, void *v)
{
    struct net *net = seq->private;
    unsigned int frag_mem;
    int orphans, sockets;

    local_bh_disable();
    orphans = percpu_counter_sum_positive(&tcp_orphan_count);
    sockets = proto_sockets_allocated_sum_positive(&tcp_prot);
    local_bh_enable();

    socket_seq_show(seq);---------------------------------------------------------从sockets_ in_use中获取每个CPU在使用中的socket数目。显示内容sockets: used xxx。tw表示TimeWait的socket数目。
    seq_printf(seq, "TCP: inuse %d orphan %d tw %d alloc %d mem %ld\n",
           sock_prot_inuse_get(net, &tcp_prot), orphans,
           atomic_read(&tcp_death_row.tw_count), sockets,
           proto_memory_allocated(&tcp_prot));------------------------------------第一个参数表示TCP协议使用的socket数目。
    seq_printf(seq, "UDP: inuse %d mem %ld\n",
           sock_prot_inuse_get(net, &udp_prot),
           proto_memory_allocated(&udp_prot));------------------------------------第一个参数表示UDP协议使用的socket数目。
    seq_printf(seq, "UDPLITE: inuse %d\n",
           sock_prot_inuse_get(net, &udplite_prot));
    seq_printf(seq, "RAW: inuse %d\n",
           sock_prot_inuse_get(net, &raw_prot));----------------------------------RAW类型socket使用数目。
    frag_mem = ip_frag_mem(net);
    seq_printf(seq,  "FRAG: inuse %u memory %u\n", !!frag_mem, frag_mem);
    return 0;
}

Sockets图表中的数据totsck、tcpsck、udpsck、rawsck、tcp-tw、ip-frag分别来源于read_net_sock()函数解析。

可见totsck表示系统所有Socket数目,其他子类表示不同类型socket数目。

void read_net_sock(struct stats_net_sock *st_net_sock)
{
    FILE *fp;
    char line[96];
    char *p;

    if ((fp = fopen(NET_SOCKSTAT, "r")) == NULL)
        return;

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "sockets:", 8)) {
            /* Sockets */
            sscanf(line + 14, "%u", &st_net_sock->sock_inuse);
        }
        else if (!strncmp(line, "TCP:", 4)) {
            /* TCP sockets */
            sscanf(line + 11, "%u", &st_net_sock->tcp_inuse);
            if ((p = strstr(line, "tw")) != NULL) {
                sscanf(p + 2, "%u", &st_net_sock->tcp_tw);
            }
        }
        else if (!strncmp(line, "UDP:", 4)) {
            /* UDP sockets */
            sscanf(line + 11, "%u", &st_net_sock->udp_inuse);
        }
        else if (!strncmp(line, "RAW:", 4)) {
            /* RAW sockets */
            sscanf(line + 11, "%u", &st_net_sock->raw_inuse);
        }
        else if (!strncmp(line, "FRAG:", 5)) {
            /* FRAGments */
            sscanf(line + 12, "%u", &st_net_sock->frag_inuse);
        }
    }

    fclose(fp);
}

3.18 huge页面统计信息

huge_act通过读取/proc/meminfo获取huge页面统计信息。

void read_meminfo_huge(struct stats_huge *st_huge)
{
    FILE *fp;
    char line[128];
    unsigned long szhkb = 0;

    if ((fp = fopen(MEMINFO, "r")) == NULL)
        return;

    while (fgets(line, sizeof(line), fp) != NULL) {

        if (!strncmp(line, "HugePages_Total:", 16)) {
            /* Read the total number of huge pages */
            sscanf(line + 16, "%lu", &st_huge->tlhkb);
        }
        else if (!strncmp(line, "HugePages_Free:", 15)) {
            /* Read the number of free huge pages */
            sscanf(line + 15, "%lu", &st_huge->frhkb);
        }
        else if (!strncmp(line, "Hugepagesize:", 13)) {
            /* Read the default size of a huge page in kB */
            sscanf(line + 13, "%lu", &szhkb);
        }
    }

    fclose(fp);

    /* We want huge pages stats in kB and not expressed in a number of pages */
    st_huge->tlhkb *= szhkb;
    st_huge->frhkb *= szhkb;
}

3.19 softnet统计信息

softnet_act通过/proc/net/softnet_stat获取统计信息,

void read_softnet(struct stats_softnet *st_softnet, int nbr)
{
    FILE *fp;
    struct stats_softnet *st_softnet_i;
    char line[1024];
    unsigned int proc_nb = 1;

    /* Open /proc/net/softnet_stat file */
    if ((fp = fopen(NET_SOFTNET, "r")) == NULL)
        return;

    /*
     * Init a structure that will contain the values for CPU "all".
     * CPU "all" doesn't exist in /proc/net/softnet_stat file, so
     * we compute its values as the sum of the values of each CPU.
     */
    memset(st_softnet, 0, sizeof(struct stats_softnet));

    while ((fgets(line, sizeof(line), fp) != NULL) && (proc_nb < nbr)) {

        st_softnet_i = st_softnet + proc_nb++;
        sscanf(line, "%x %x %x %*x %*x %*x %*x %*x %*x %x %x",
               &st_softnet_i->processed,
               &st_softnet_i->dropped,
               &st_softnet_i->time_squeeze,
               &st_softnet_i->received_rps,
               &st_softnet_i->flow_limit);

        st_softnet->processed += st_softnet_i->processed;
        st_softnet->dropped += st_softnet_i->dropped;
        st_softnet->time_squeeze += st_softnet_i->time_squeeze;
        st_softnet->received_rps += st_softnet_i->received_rps;
        st_softnet->flow_limit += st_softnet_i->flow_limit;
    }

    fclose(fp);
}

/proc/net/softnet_stat数据来源于softnet_seq_show()。

static int softnet_seq_show(struct seq_file *seq, void *v)
{
    struct softnet_data *sd = v;
    unsigned int flow_limit_count = 0;

#ifdef CONFIG_NET_FLOW_LIMIT
    struct sd_flow_limit *fl;

    rcu_read_lock();
    fl = rcu_dereference(sd->flow_limit);
    if (fl)
        flow_limit_count = fl->count;
    rcu_read_unlock();
#endif

    seq_printf(seq,
           "%08x %08x %08x %08x %08x %08x %08x %08x %08x %08x %08x\n",
           sd->processed, sd->dropped, sd->time_squeeze, 0,
           0, 0, 0, 0, /* was fastroute */
           0,    /* was cpu_collision */
           sd->received_rps, flow_limit_count);
    return 0;
}

 

posted on 2018-12-04 20:58  ArnoldLu  阅读(3103)  评论(0编辑  收藏  举报

导航