linux/arch/openrisc/include/asm
Stafford Horne f5d45dc911 openrisc: Add optimized memcpy routine
The generic memcpy routine provided in kernel does only byte copies.
Using word copies we can lower boot time and cycles spend in memcpy
quite significantly.

Booting on my de0 nano I see boot times go from 7.2 to 5.6 seconds.
The avg cycles in memcpy during boot go from 6467 to 1887.

I tested several algorithms (see code in previous patch mails)

The implementations I tested and avg cycles:
  - Word Copies + Loop Unrolls + Non Aligned    1882
  - Word Copies + Loop Unrolls                  1887
  - Word Copies                                 2441
  - Byte Copies + Loop Unrolls                  6467
  - Byte Copies                                 7600

In the end I ended up going with Word Copies + Loop Unrolls as it
provides best tradeoff between simplicity and boot speedups.

Signed-off-by: Stafford Horne <shorne@gmail.com>
2017-02-25 04:14:36 +09:00
..
bitops openrisc: add atomic bitops 2017-02-25 04:12:38 +09:00
asm-offsets.h OpenRISC: Headers 2011-07-22 18:46:39 +02:00
atomic.h openrisc: add optimized atomic operations 2017-02-25 04:14:06 +09:00
bitops.h openrisc: add atomic bitops 2017-02-25 04:12:38 +09:00
cache.h openrisc: Define __ro_after_init to avoid crash 2016-11-06 08:01:12 -08:00
cmpxchg.h openrisc: add cmpxchg and xchg implementations 2017-02-25 04:14:00 +09:00
cpuinfo.h openrisc: add cache way information to cpuinfo 2017-02-06 21:50:41 +09:00
delay.h OpenRISC: Headers 2011-07-22 18:46:39 +02:00
dma-mapping.h dma-mapping: always provide the dma_map_ops based implementation 2016-01-20 17:09:18 -08:00
elf.h default SET_PERSONALITY() in linux/elf.h 2013-02-26 02:46:08 -05:00
fixmap.h
futex.h openrisc: add futex_atomic_* implementations 2017-02-25 04:14:23 +09:00
io.h asm-generic/io.h: remove asm/cacheflush.h include 2012-10-25 16:06:57 +02:00
irq.h openrisc: Get rid of handle_IRQ 2014-09-03 13:11:02 +00:00
irqflags.h OpenRISC: IRQ 2011-07-22 18:46:33 +02:00
Kbuild openrisc: add futex_atomic_* implementations 2017-02-25 04:14:23 +09:00
linkage.h OpenRISC: Headers 2011-07-22 18:46:39 +02:00
mmu_context.h
mmu.h
page.h openrisc: drop wrongly typed definition of page_to_virt() 2016-04-22 10:08:34 +01:00
pgalloc.h openrisc: Consolidate setup to use memblock instead of bootmem 2016-12-12 23:10:00 +09:00
pgtable.h openrisc: fix PTRS_PER_PGD define 2016-12-12 23:09:06 +09:00
processor.h locking/core: Provide common cpu_relax_yield() definition 2016-11-17 08:17:36 +01:00
ptrace.h UAPI: (Scripted) Disintegrate arch/openrisc/include/asm 2012-10-09 09:47:18 +01:00
serial.h OpenRISC: Headers 2011-07-22 18:46:39 +02:00
spinlock.h OpenRISC: Headers 2011-07-22 18:46:39 +02:00
spr_defs.h openrisc: Fix the bitmask for the unit present register 2017-02-25 04:14:34 +09:00
spr.h OpenRISC: Headers 2011-07-22 18:46:39 +02:00
string.h openrisc: Add optimized memcpy routine 2017-02-25 04:14:36 +09:00
syscall.h ARCH: AUDIT: implement syscall_get_arch for all arches 2014-09-23 16:20:10 -04:00
syscalls.h openrisc: switch to use of generic fork and clone 2012-11-28 23:43:40 -05:00
thread_info.h arch: Remove exec_domain from remaining archs 2015-04-12 21:03:30 +02:00
timex.h OpenRISC: Timekeeping 2011-07-22 18:46:32 +02:00
tlb.h
tlbflush.h
uaccess.h exceptions: detritus removal 2016-09-27 21:15:14 -04:00
unaligned.h OpenRISC: Headers 2011-07-22 18:46:39 +02:00