Friedrich Marty's picture

Friedrich Marty

Smorty100

AI & ML interests

I'm most interested in content rerouting between LLM and VLLM agens for automation possibilities. Using templates for each agent which is then filled in by another agents inputs seems really useful.

Recent Activity

reacted to Abhaykoul's post with πŸ€— 1 day ago
πŸš€ Ever dreamed of training your own Large Language Model from scratch? What if I told you it doesn't require a supercomputer or PhD in ML? 🀯 Introducing LLM Trainer - the educational framework that makes LLM training accessible to EVERYONE! Whether you're on a CPU-only laptop or scaling to distributed GPUs, we've got you covered. πŸ’»βž‘οΈπŸ–₯️ Why LLM Trainer? Because existing tools are either too simplistic (hiding the magic) or too complex (requiring expert knowledge). We bridge the gap with: πŸŽ“ Educational transparency - every component built from scratch with clear code πŸ’» CPU-first approach - start training immediately, no GPU needed πŸ”§ Full customization - modify anything you want πŸ“ˆ Seamless scaling - from laptop to cluster without code changes 🀝 HuggingFace integration - works with existing models & tokenizers Key highlights: βœ… Built-in tokenizers (BPE, WordPiece, HF wrappers) βœ… Complete Transformer implementation from scratch βœ… Optimized for CPU training βœ… Advanced features: mixed precision, gradient checkpointing, multiple generation strategies βœ… Comprehensive monitoring & metrics Perfect for: - Students learning transformers - Researchers prototyping new ideas - Developers building domain-specific models Ready to train your first LLM? It's easier than you think! πŸ”— Check it out: https://github.com/HelpingAI/llm-trainer πŸ“š Docs: Getting Started Guide πŸ’¬ Join the community: GitHub Discussions #AI #MachineLearning #LLM #DeepLearning #OpenSource #Python #HuggingFace #NLP Special thanks to HuggingFace and PyTorch teams for the amazing ecosystem! πŸ™
liked a Space 3 days ago
vectara/leaderboard
View all activity

Organizations

None yet