Geoffrey McRae
|
e8b1b8fbdf
|
[common] tune windows memcpySSE asm implementation:wq
|
2018-05-19 21:40:13 +10:00 |
|
Geoffrey McRae
|
e9d77e6c52
|
[common] inline memcpy into memcpySSE for the final bytes
|
2018-05-19 18:31:49 +10:00 |
|
Geoffrey McRae
|
56f0a8525b
|
[common] more SSE improvements
* 32bit inlined is slow for only large copies, warn if memcpySSE is
used when it shouldn't be.
* Removed 64bit memcpySSE as native inlined is faster
See: https://stackoverflow.com/questions/50422510/why-is-i386-memcpy-slow-on-x86-64
|
2018-05-19 18:27:04 +10:00 |
|
Geoffrey McRae
|
778af24d82
|
[common] inline get_pc to memcpySSE
|
2018-05-19 16:21:12 +10:00 |
|
Geoffrey McRae
|
cd6caea4b0
|
[x86] use a proper call/ret to obtain the current IP
See: https://blogs.msdn.microsoft.com/oldnewthing/20041216-00/?p=36973
|
2018-05-19 16:16:01 +10:00 |
|
Geoffrey McRae
|
f63c8043af
|
[common] new sse2 memcpy improvements
|
2018-05-18 20:56:57 +10:00 |
|
Geoffrey McRae
|
3c77c1eb2b
|
NASM version of a SSE2 memcpy
|
2018-05-18 18:50:07 +10:00 |
|
Geoffrey McRae
|
ffec6c2014
|
Incoming new memcpy implementation
|
2018-05-18 01:59:00 +10:00 |
|