Get in Touch
Our team is happy to answer your questions. Please fill out the form and we will be in touch with you as soon as possible.
DL Accelerator Kernel & Tools


Delivering SDK library components for CPU, GPU, DSP, DL accelerator
MulticoreWare engineering consulting services include low level software like Kernel development, porting kernels, profiling & optimizing kernels, memory planning, instructions pipelining to middle level software like CNN framework parsers, CNN model format converters, model compression & optimization etc. With vast array of hardware micro-architecture (CPU, GPU, DSP, DL HW accelerator) awareness and its experience to deliver consulting services uniquely positions MulticoreWare to take up any general purpose as well as custom hardware accelerator that are targeted for AI/ML/DL applications.
Examples of areas research, development, and solutions delivered include:
- Kernel optimization with SIMD intrinsic
- Kernel designs to achieve CPP (Cycles per Pixels) spec
- Kernel optimization for various NN kernels (data-bound & compute-bound) on various DSP micro-architecture
- OpenVX, OpenCV, OpenMP and OpenCL libraries to various DL accelerator platforms
- End-to-end pipeline creation for embedded computer vision (CV) functions
- Parallel computing, pipeline stalls, Runtime engine optimization etc.
- End-to-end inference pipeline (Inception-V3, Yolo-v2/v3 etc.)
- High Performance workload benchmarking

Get in touch
Our team is happy to answer your questions. Please fill out the form and we will be in touch with you as soon as possible.
News & Updates

Join MulticoreWare at Detroit for AutoSens2022 this May at Michigan Science Center
Join us at Detroit for AutoSens2022 this May This May join Multicoreware for AutoSens2022 Detroit at the Michigan... Read more
MulticoreWare Inc.’s VVC Consortium Gains Momentum
SAN JOSE, Calif. 23 April 2022–(BUSINESS WIRE)–MulticoreWare has been leading the x266 open-source encoder project in the past year... Read more