Skip to content
Change the repository type filter

All

    Repositories list

    • DeepHYDRA

      Public
      Python
      Apache License 2.0
      0110Updated Aug 27, 2024Aug 27, 2024
    • Rapid Partitioning-based Deformable Image Registration on Multi-GPU Accelerator
      Shell
      GNU General Public License v3.0
      0200Updated Aug 4, 2024Aug 4, 2024
    • Python
      Apache License 2.0
      0000Updated Jun 28, 2024Jun 28, 2024
    • Python
      MIT License
      1100Updated Jun 5, 2024Jun 5, 2024
    • Python
      0000Updated Apr 22, 2024Apr 22, 2024
    • galen

      Public
      Galen: Hardware-specific Automatic Compression of Neural Networks
      Python
      MIT License
      1001Updated Jul 24, 2023Jul 24, 2023
    • CUDAsap

      Public
      C++
      MIT License
      0000Updated Mar 17, 2023Mar 17, 2023
    • Build repo for PYNQ on the ZCU216 RFSOC
      Shell
      15000Updated Apr 28, 2022Apr 28, 2022
    • Simulator for memory access patterns of FPGA-based graph processing accelerators
      0410Updated Dec 22, 2021Dec 22, 2021
    • Tutorial files for ''Instrumentation and Modeling of Performance and Power Consumption for Massively Parallel Processors'' at HiPEAC 2021 Conference -- https://www.hipeac.net/2021/spring-virtual/#/program/sessions/7856/
      Jupyter Notebook
      0000Updated Dec 17, 2021Dec 17, 2021
    • cuda-flux

      Public
      CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels
      C++
      MIT License
      63140Updated Mar 15, 2021Mar 15, 2021
    • machine learning model for execution time and power prediction of CUDA kernels
      Jupyter Notebook
      6400Updated Jan 25, 2021Jan 25, 2021
    • arm-peak

      Public
      Measure computational peak performance on embedded ARM processors.
      C++
      MIT License
      0100Updated Nov 27, 2020Nov 27, 2020
    • camuy

      Public
      Fast evaluation of CNNs on configurable systolic arrays based on abstract metrics
      C++
      MIT License
      7300Updated Aug 11, 2020Aug 11, 2020
    • LLVM Plugin to Instrument Global Memory Accesses in CUDA Kernels
      LLVM
      Other
      2810Updated Jun 8, 2020Jun 8, 2020
    • Automatically partitioning compiler for CUDA (WIP) based on the LLVM infrastructure.
      C++
      MIT License
      1600Updated Jul 19, 2019Jul 19, 2019
    • Simple tool to measure the average and worst case resolution of the gettimeofday call.
      C
      MIT License
      0100Updated Jul 3, 2019Jul 3, 2019
    • A no-dependency python library (with okayish performance) for sets of interval ranges
      Python
      0000Updated Jan 25, 2019Jan 25, 2019
    • ECML2018

      Public
      Python
      0000Updated Jul 4, 2018Jul 4, 2018
    • MScTI_APC

      Public
      Accompanying material for course "Advanced Parallel Computing", Institute of Computer Engineering, Ruprecht-Karls University of Heidelberg, Germany
      C
      BSD 3-Clause "New" or "Revised" License
      2400Updated Jul 26, 2017Jul 26, 2017
    • sonar

      Public
      Trace analysis utility for OTF traces
      C++
      0100Updated Mar 15, 2017Mar 15, 2017
    • A GPU-based Graph500 implementation providing compressed data movements.
      C++
      2510Updated Jun 29, 2016Jun 29, 2016