Loading...
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 | ====================================== HNS3 Performance Monitoring Unit (PMU) ====================================== HNS3(HiSilicon network system 3) Performance Monitoring Unit (PMU) is an End Point device to collect performance statistics of HiSilicon SoC NIC. On Hip09, each SICL(Super I/O cluster) has one PMU device. HNS3 PMU supports collection of performance statistics such as bandwidth, latency, packet rate and interrupt rate. Each HNS3 PMU supports 8 hardware events. HNS3 PMU driver =============== The HNS3 PMU driver registers a perf PMU with the name of its sicl id.:: /sys/devices/hns3_pmu_sicl_<sicl_id> PMU driver provides description of available events, filter modes, format, identifier and cpumask in sysfs. The "events" directory describes the event code of all supported events shown in perf list. The "filtermode" directory describes the supported filter modes of each event. The "format" directory describes all formats of the config (events) and config1 (filter options) fields of the perf_event_attr structure. The "identifier" file shows version of PMU hardware device. The "bdf_min" and "bdf_max" files show the supported bdf range of each pmu device. The "hw_clk_freq" file shows the hardware clock frequency of each pmu device. Example usage of checking event code and subevent code:: $# cat /sys/devices/hns3_pmu_sicl_0/events/dly_tx_normal_to_mac_time config=0x00204 $# cat /sys/devices/hns3_pmu_sicl_0/events/dly_tx_normal_to_mac_packet_num config=0x10204 Each performance statistic has a pair of events to get two values to calculate real performance data in userspace. The bits 0~15 of config (here 0x0204) are the true hardware event code. If two events have same value of bits 0~15 of config, that means they are event pair. And the bit 16 of config indicates getting counter 0 or counter 1 of hardware event. After getting two values of event pair in userspace, the formula of computation to calculate real performance data is::: counter 0 / counter 1 Example usage of checking supported filter mode:: $# cat /sys/devices/hns3_pmu_sicl_0/filtermode/bw_ssu_rpu_byte_num filter mode supported: global/port/port-tc/func/func-queue/ Example usage of perf:: $# perf list hns3_pmu_sicl_0/bw_ssu_rpu_byte_num/ [kernel PMU event] hns3_pmu_sicl_0/bw_ssu_rpu_time/ [kernel PMU event] ------------------------------------------ $# perf stat -g -e hns3_pmu_sicl_0/bw_ssu_rpu_byte_num,global=1/ -e hns3_pmu_sicl_0/bw_ssu_rpu_time,global=1/ -I 1000 or $# perf stat -g -e hns3_pmu_sicl_0/config=0x00002,global=1/ -e hns3_pmu_sicl_0/config=0x10002,global=1/ -I 1000 Filter modes -------------- 1. global mode PMU collect performance statistics for all HNS3 PCIe functions of IO DIE. Set the "global" filter option to 1 will enable this mode. Example usage of perf:: $# perf stat -a -e hns3_pmu_sicl_0/config=0x1020F,global=1/ -I 1000 2. port mode PMU collect performance statistic of one whole physical port. The port id is same as mac id. The "tc" filter option must be set to 0xF in this mode, here tc stands for traffic class. Example usage of perf:: $# perf stat -a -e hns3_pmu_sicl_0/config=0x1020F,port=0,tc=0xF/ -I 1000 3. port-tc mode PMU collect performance statistic of one tc of physical port. The port id is same as mac id. The "tc" filter option must be set to 0 ~ 7 in this mode. Example usage of perf:: $# perf stat -a -e hns3_pmu_sicl_0/config=0x1020F,port=0,tc=0/ -I 1000 4. func mode PMU collect performance statistic of one PF/VF. The function id is BDF of PF/VF, its conversion formula:: func = (bus << 8) + (device << 3) + (function) for example: BDF func 35:00.0 0x3500 35:00.1 0x3501 35:01.0 0x3508 In this mode, the "queue" filter option must be set to 0xFFFF. Example usage of perf:: $# perf stat -a -e hns3_pmu_sicl_0/config=0x1020F,bdf=0x3500,queue=0xFFFF/ -I 1000 5. func-queue mode PMU collect performance statistic of one queue of PF/VF. The function id is BDF of PF/VF, the "queue" filter option must be set to the exact queue id of function. Example usage of perf:: $# perf stat -a -e hns3_pmu_sicl_0/config=0x1020F,bdf=0x3500,queue=0/ -I 1000 6. func-intr mode PMU collect performance statistic of one interrupt of PF/VF. The function id is BDF of PF/VF, the "intr" filter option must be set to the exact interrupt id of function. Example usage of perf:: $# perf stat -a -e hns3_pmu_sicl_0/config=0x00301,bdf=0x3500,intr=0/ -I 1000 |