Projects with this topic
-
https://github.com/vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Updated -
https://github.com/vllm-project/vllm-ascend Community maintained hardware plugin for vLLM on AscendUpdated -
-
https://github.com/takara-ai/go-attentionA full attention mechanism and transformer in pure go.
Updated -
https://github.com/InternLM/MindSearch
An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) https://mindsearch.netlify.app/Updated -
huggingface.co/transformers https://github.com/huggingface/transformers Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.Updated -
https://github.com/THUDM/SwissArmyTransformer SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
Updated -
https://github.com/gpustack/llama-boxLM inference server implementation based on llama.cpp.
Updated -
https://github.com/forhaoliu/instructrlInstruction Following Agents with Multimodal Transforemrs
This is a Jax implementation for the InstructRL method.
Updated -
https://github.com/THUDM/CogView2 official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"
Updated