opencv/modules
Vitaly Tuzov 3b015dfc7d Merge pull request #14210 from terfendail:wui_512
AVX512 wide universal intrinsics (#14210)

* Added implementation of 512-bit wide universal intrinsics(WIP)

* Added implementation of 512-bit wide universal intrinsics: implemented WUI vector types(WIP)

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented load/store

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented fp16 load/store

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented recombine and zip, implemented non-saturating and saturating arithmetics

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented bit operations

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented comparisons

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented lane shifts and reduction

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented absolute values

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented rounding and cast to float

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented LUT

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented type extension/narrowing and matrix operations

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented load_deinterleave for 2 and 3 channels images

* Added implementation of 512-bit wide universal intrinsics(WIP): reimplemented load_deinterleave for 2- and implemented for 4-channel images

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented store_interleave

* Added implementation of 512-bit wide universal intrinsics(WIP): implemented signmask and checks

* Added implementation of 512-bit wide universal intrinsics(WIP): build fixes

* Added implementation of 512-bit wide universal intrinsics(WIP): reimplemented popcount in case AVX512_BITALG is unavailable

* Added implementation of 512-bit wide universal intrinsics(WIP): reimplemented zip

* Added implementation of 512-bit wide universal intrinsics(WIP): reimplemented rotate for s8 and s16

* Added implementation of 512-bit wide universal intrinsics(WIP): reimplemented interleave/deinterleave for s8 and s16

* Added implementation of 512-bit wide universal intrinsics(WIP): updated v512_set macros

* Added implementation of 512-bit wide universal intrinsics(WIP): fix for GCC wrong _mm512_abs_pd definition

* Added implementation of 512-bit wide universal intrinsics(WIP): reworked v_zip to avoid AVX512_VBMI intrinsics

* Added implementation of 512-bit wide universal intrinsics(WIP): reworked v_invsqrt to avoid AVX512_ER intrinsics

* Added implementation of 512-bit wide universal intrinsics(WIP): reworked v_rotate, v_popcount and interleave/deinterleave for U8 to avoid AVX512_VBMI intrinsics

* Added implementation of 512-bit wide universal intrinsics(WIP): fixed integral image SIMD part

* Added implementation of 512-bit wide universal intrinsics(WIP): fixed warnings

* Added implementation of 512-bit wide universal intrinsics(WIP): fixed load_deinterleave for u8 and u16

* Added implementation of 512-bit wide universal intrinsics(WIP): fixed v_invsqrt accuracy for f64

* Added implementation of 512-bit wide universal intrinsics(WIP): fixed interleave/deinterleave for u32 and u64

* Added implementation of 512-bit wide universal intrinsics(WIP): fixed interleave_pairs, interleave_quads and pack_triplets

* Added implementation of 512-bit wide universal intrinsics(WIP): fixed rotate_left

* Added implementation of 512-bit wide universal intrinsics(WIP): fixed rotate_left/right, part 2

* Added implementation of 512-bit wide universal intrinsics(WIP): fixed 512-wide universal intrinsics based resize

* Added implementation of 512-bit wide universal intrinsics(WIP): fixed findContours by avoiding use of uint64 dependent 512-wide v_signmask()

* Added implementation of 512-bit wide universal intrinsics(WIP): fixed trailing whitespaces

* Added implementation of 512-bit wide universal intrinsics(WIP): reworked specific intrinsic sets dependent parts to check availability of intrinsics based on CPU feature group defines

* Added implementation of 512-bit wide universal intrinsics(WIP):Updated AVX512 implementation of v_popcount to avoid AVX512VPOPCNTDQ intrinsics if unavailable.

* Added implementation of 512-bit wide universal intrinsics(WIP): Fixed universal intrinsics data initialisation, v_mul_wrap, v_floor, v_ceil and v_signmask.

* Added implementation of 512-bit wide universal intrinsics(WIP): Removed hasSIMD512()

* Added implementation of 512-bit wide universal intrinsics(WIP): Fixes for gcc build

* Added implementation of 512-bit wide universal intrinsics(WIP): Reworked v_signmask, v_check_any() and v_check_all() implementation.
2019-06-03 18:05:35 +03:00
..
calib3d Merge pull request #14447 from catree:fix_issue_14423 2019-05-27 15:00:21 +00:00
core Merge pull request #14210 from terfendail:wui_512 2019-06-03 18:05:35 +03:00
cudaarithm Move Ptr-related code from lut.cu to lut.cpp 2019-03-13 20:25:28 +09:00
cudabgsegm
cudacodec
cudafeatures2d
cudafilters Merge pull request #13695 from flyingfalling:3.4 2019-03-13 20:53:59 +03:00
cudaimgproc Add CV_16UC1 support for cuda::CLAHE 2019-02-06 17:21:55 +00:00
cudalegacy cudalegacy: Use safe block scan function 2019-02-13 01:55:02 +09:00
cudaobjdetect
cudaoptflow
cudastereo cuda::StereoBM - fix hanging and racing issue 2019-02-18 06:03:12 +00:00
cudawarping
cudev CUDA 10.1 Build Issue Fix 2019-03-03 16:40:43 +00:00
dnn Merge pull request #14662 from dkurt:dnn_tf_fix_identity_switch 2019-05-30 20:22:17 +00:00
features2d build: avoid highgui dependency 2019-04-15 18:59:51 +00:00
flann flann/kmeans: Fix non-determinism of KMeans index 2019-03-20 08:44:31 -07:00
highgui cmake: use absolute library paths from 'pkgconfig' 2019-04-15 22:11:49 +00:00
imgcodecs Merge pull request #14622 from asashour:junit 2019-05-27 14:55:45 +00:00
imgproc Merge pull request #14210 from terfendail:wui_512 2019-06-03 18:05:35 +03:00
java Merge pull request #14606 from asashour:java_inline_return 2019-05-29 20:03:54 +00:00
js js: added floodFill function to JS binding 2019-05-21 14:19:50 +02:00
ml fix a typo in md file 2019-05-01 14:05:33 +02:00
objdetect java: test: use assertNotNull and assertFalse 2019-05-24 10:45:09 +02:00
photo Merge pull request #13655 from sturkmen72:update_doc_photo 2019-03-13 17:57:17 +00:00
python python: discover tests from module/misc/python/test paths 2019-04-10 18:35:35 +00:00
shape python: discover tests from module/misc/python/test paths 2019-04-10 18:35:35 +00:00
stitching python: discover tests from module/misc/python/test paths 2019-04-10 18:35:35 +00:00
superres
ts java(test): test package filtering 2019-05-26 15:12:05 +00:00
video python: discover tests from module/misc/python/test paths 2019-04-10 18:35:35 +00:00
videoio Merge pull request #14592 from aleksey-nikolaev:master 2019-05-27 19:18:36 +03:00
videostab
viz
world
CMakeLists.txt