Decoder and Encoder LLM Models

Rockchip RK1820/RK1828 SO-DIMM and M.2 LLM/VLM AI accelerator modules, devkits, and benchmarks

Rockchip unveiled two RK182X LLM/VLM accelerators at its developer conference last July, namely the RK1820 with 2.5GB RAM for ...

EurekAlert!

Beyond bigger models: How efficient multimodal AI is redefining the future of intelligence

Multimodal large language models have shown powerful abilities to understand and reason across text and images, but their ...

16d

Bolmo’s architecture unlocks efficient byte‑level LM training without sacrificing quality

Ai2 releases Bolmo, a new byte-level language model the company hopes would encourage more enterprises to use byte level architecture.

IEEE

Knowledge Probing on Decoder-Only Models in Medical Domain

Language models (LLMs) have become an integral part of numerous applications in the biomedical domain, leveraging their ability to process and generate human-like text [1]. In this study, we focus on ...

GitHub

AI-Trader: Can AI Beat the Market?

TradeTrap: A security-focused toolkit to evaluate and harden LLM-based trading agents, featuring prompt injection and MCP hijacking attack modules for resilience testing. RockAlpha: The investment ...

IEEE

STCA-LLM: Spatial–Temporal Cross-Attention Large Language Model for Wind Speed Forecasting

Abstract: Accurately forecasting wind speed is crucial for efficiently utilizing the renewable energy, stabilizing the energy system and advancing the progress of the decarbonization of our society.

Microsoft

LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation - Microsoft Research

CLIP is one of the most important multimodal foundational models today, aligning visual and textual signals into a shared feature space using a simple contrastive learning loss on large-scale ...

VentureBeat

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models (VLMs) optimized for multimodal reasoning, frontend automation, and ...

GitHub

[Bug] Slow on gfx1150 with both Vulkan and ROCm builds

./bin/sd --diffusion-model /data/comfyui/models/diffusion_models/z_image_turbo_bf16.safetensors --vae /data/comfyui/models/vae/ae.safetensors --llm /data/comfyui ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results