You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The sizeof 'struct ibv_qp' allocated by ibv_create_qp is 160.
If the memory holds the 'struct ibv_qp' was allocated at the
upper boundary of a memory page, cast it to 'struct verbs_qp',
whose size is 360, may across the memory page boundary. It will
trigger invalid memory access to the next memory page.
The issue can be reproduced with OPA and QIB HCA.
For example run over OPA:
Server Node: $ ib_read_bw -F -N -n 1000 -u 20 -q 257 -s 4194304
Client Node: $ ib_read_bw -F -N -n 1000 -u 20 -q 257 -s 4194304 <sever>
Program received signal SIGSEGV, Segmentation fault.
ibv_qp_to_qp_ex (qp=0x5555557a5f10) at libibverbs/verbs.c:624
624 if (vqp->comp_mask & VERBS_QP_EX)
(gdb) bt
#0 ibv_qp_to_qp_ex (qp=0x5555557a5f10) at libibverbs/verbs.c:624
#1 0x000055555556af4a in create_reg_qp_main (ctx=ctx@entry=0x7fffffffd500, user_param=user_param@entry=0x7fffffffd670, i=i@entry=21, num_of_qps=num_of_qps@entry=128) at src/perftest_resources.c:1597
#2 0x000055555556b6d7 in create_qp_main (num_of_qps=<optimized out>, i=21, user_param=0x7fffffffd670, ctx=0x7fffffffd500) at src/perftest_resources.c:1613
#3 ctx_init (ctx=0x7fffffffd500, user_param=0x7fffffffd670) at src/perftest_resources.c:1552
#4 0x0000555555558e9c in main (argc=<optimized out>, argv=<optimized out>) at src/read_bw.c:149
624 if (vqp->comp_mask & VERBS_QP_EX)
(gdb) p qp
$1 = (struct ibv_qp *) 0x5555557a5f10
(gdb) p vqp
$2 = (struct verbs_qp *) 0x5555557a5f10
(gdb) p *qp
$3 = {context = 0x55555578ad00, qp_context = 0x0, ....
(gdb) p *vqp
Cannot access memory at address 0x5555557a6000
Signed-off-by: Honggang Li <[email protected]>
Hi,
I'm trying to run a latency test for the first time but I get an error when I try to execute it.
My steps:
Downloaded and unzipped the repo on the box on which I want to run the server for the test
Executed:
./autogen
./configure
make clean && make V=1
./ib_send_lat --duration=30 -H
What am I doing wrong?
Running other tests using RDMA (eg. the ones in Accelio) the system shows no problems.
Thanks
The text was updated successfully, but these errors were encountered: