AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU)...

Last updated: 6 days ago

libfabric

Open Fabric Interfaces

Last updated: 6 days ago

MITuna

Last updated: 6 days ago

pytorch_scatter

PyTorch Extension Library of Optimized Scatter Operations

Last updated: 6 days ago

pytorch-lightning

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Last updated: 6 days ago

recommenders-addons

Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.

Last updated: 6 days ago

rtg_tracer

Last updated: 6 days ago

kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

Last updated: 6 days ago

rocWMMA

rocWMMA

Last updated: 6 days ago

composable_kernel_remove

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

Last updated: 6 days ago

sukha-tools

Profiling tools intended for Nvidia or AMD GPUs

Last updated: 6 days ago

hipify_torch

Last updated: 6 days ago

triton

Development repository for the Triton language and compiler

Last updated: 6 days ago

gpufort

GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify

Last updated: 6 days ago

tensorflow-build

Build-related tools for TensorFlow

Last updated: 6 days ago

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Last updated: 6 days ago

ClassyVision

An end-to-end PyTorch framework for image and video classification

Last updated: 6 days ago

hipFFT

hipFFT is a FFT marshalling library.

Last updated: 6 days ago

tensorflow-upstream

TensorFlow ROCm port

Last updated: 6 days ago

bert

TensorFlow code and pre-trained models for BERT

Last updated: 6 days ago
Achievement
2
Star
2
Fork
People(1)
镜像

Search