Abstract: There is growing interest in ensuring that large language models (LLMs) align with human values. However, the alignment of such models is vulnerable to adversarial jailbreaks, which coax ...
We are keeping improving the documents and adding more implementation details. Please stay tuned at README-DEV.md for more information. 🦊WhiteFox is the first white-box compiler fuzzer using LLMs ...
Large language models (LLMs) show excellent performance but are compute- and memory-intensive. Quantization can reduce memory and accelerate inference. However, for LLMs beyond 100 billion parameters, ...