subject:"\[dpdk\-dev\] \[PATCH v2\] eal\: optimize aligned rte_memcpy"

Re: [dpdk-dev] [PATCH v2] eal: optimize aligned rte_memcpy

2017-01-17 Thread Thomas Monjalon

2016-12-08 10:18, Yuanhan Liu: > On Tue, Dec 06, 2016 at 08:31:06PM -0500, Zhihong Wang wrote: > > This patch optimizes rte_memcpy for well aligned cases, where both > > dst and src addr are aligned to maximum MOV width. It introduces a > > dedicated function called rte_memcpy_aligned to handle the

Re: [dpdk-dev] [PATCH v2] eal: optimize aligned rte_memcpy

2016-12-07 Thread Yuanhan Liu

On Tue, Dec 06, 2016 at 08:31:06PM -0500, Zhihong Wang wrote: > This patch optimizes rte_memcpy for well aligned cases, where both > dst and src addr are aligned to maximum MOV width. It introduces a > dedicated function called rte_memcpy_aligned to handle the aligned > cases with simplified instru

Re: [dpdk-dev] [PATCH v2] eal: optimize aligned rte_memcpy

2016-12-07 Thread Yao, Lei A

Tested-by: Lei Yao - Apply patch to v16.11 I have tested the loopback performance for this patch on 3 following settings: CPU: IVB Ubutnu16.04 Kernal: 4.4.0 gcc : 5.4.0 CPU: HSW Fedora 21 Kernal: 4.1.13 gcc: 4.9.2 CPU:BDW Ubutnu16.04 Kernal: 4.4.0 gcc : 5.4.0 I can see 10%~20% performance gai

[dpdk-dev] [PATCH v2] eal: optimize aligned rte_memcpy

2016-12-07 Thread Zhihong Wang

This patch optimizes rte_memcpy for well aligned cases, where both dst and src addr are aligned to maximum MOV width. It introduces a dedicated function called rte_memcpy_aligned to handle the aligned cases with simplified instruction stream. The existing rte_memcpy is renamed as rte_memcpy_generic