RC-CoSA: Controllable secure alignment architecture for large language models based on risk-constrained inference search

Meng, Linghao

doi:10.65102/is20261283

Research article

Ingegneria Sismica

Volume 43 Issue 3
Pages: 1
-20

RC-CoSA: Controllable secure alignment architecture for large language models based on risk-constrained inference search

Author(s): ^¹

¹School of Electrical Automation and Information Engineering, Tianjin University, Tianjin, China, 300072

Published: 10/06/2026

Cite

Meng, Linghao . “RC-CoSA: Controllable secure alignment architecture for large language models based on risk-constrained inference search.” Ingegneria Sismica Volume 43 Issue 3: 1-20, doi:10.65102/is20261283.

https://doi.org/10.65102/is20261283

Abstract

The current secure alignment of large language models (LLMs) generally adopts a static paradigm, that is, training a single model through predefined general principles. However, this approach lacks flexibility in the face of diverse security needs in different cultural backgrounds, geographical norms, and specific application scenarios. At the same time, re-aligning models for each segment requirement will bring high computing costs and engineering overhead. To this end, we propose a risk-constrained controllable safety alignment architecture (RC-CoSA), which aims to adapt the model to diverse and intertwined safety requirements in the inference stage without updating the underlying model parameters. Compared with existing methods that rely on single-sample autoregressive generation, RC-CoSA improves the robustness and controllability of response generation under complex security configurations through compliance-first best-of-N candidate screening, structured security completion for partial-compliance scenarios, and decoupling multi-stage reasoning-evaluation process. The experimental results show that the actual benefits of RC-CoSA have a certain base dependence: on the DeepSeek base, the proposed method significantly reduces the Helpful + Unsafe ratio from 11.0% to 0.5%, and increases the CoSA-Score to 0.596, and improves the overall information validity. On the GPT-4o base, RC-CoSAlign also increased the CoSA-Score from 0.288 to 0.349 and the Helpful + Safe from 50.8% to 61.9%, but its compression of the risk of violations is relatively limited. On the Llama3.1-8B-INST base, although the inference period enhancement can improve the comprehensive control performance, its inhibition stability against the risk of violation is still affected by the characteristics of the base model. The above results show that RC-CoSA, as an inference-period execution control framework, can effectively improve the comprehensive controllability of the model under complex security configurations, but its benefit intensity is still affected by the original security boundary, generation distribution and instruction compliance ability of the base model.

Keywords
Large Language Models, Risk-Constrained Controllable Safety Alignment, Best-of-N Optimization, Inference-Time Adaptation

Research article
https://doi.org/10.65102/is20261302

Visual analysis of related hotspots affecting Diab...

Volume 43 Issue 3
Pages: 1
-18
08/07/2026

^¹, ^¹

¹Guangzhou University of Chinese Medicine, School of Pharmaceutical Medicine, Guangzhou,Guangdong,China,510006

Research article
https://doi.org/10.65102/is20261300

Research on high-quality image super-resolution re...

Volume 43 Issue 3
Pages: 1
-21
08/07/2026

^¹,², ^¹,², ^¹

¹Hainan Vocational University of Science and Technology, Haikou 571126, China

²Institute for Mathematical Research, Universiti Putra Malaysia, Serdang 43400, Malaysia

Research article
https://doi.org/10.65102/is20261301

Multi-scale Dual Transformer based Multi long-term...

Volume 43 Issue 3
Pages: 1
-18
08/07/2026

^¹,², ^¹,², ^¹

¹Hainan Vocational University of Science and Technology, Haikou 571126, China

²Institute for Mathematical Research, Universiti Putra Malaysia, Serdang 43400, Malaysia

Research article
https://doi.org/10.65102/is20261299

Ultra-Short-Term Wind Power Forecasting Based on V...

Volume 43 Issue 3
Pages: 1
-15
08/07/2026

^¹, ^², ^¹, ^¹, ^¹

¹Electric Power Research Institute, State Grid Shanxi Electric Power Co., Ltd., Taiyuan, 030001, Shanxi, China

²Jincheng Power Supply Branch, State Grid Shanxi Electric Power Co., Ltd., Jincheng, 048000, Shanxi, China

Research article
https://doi.org/10.65102/is20261298

Integration of Traditional Culture Elements and Co...

Volume 43 Issue 3
Pages: 1
-12
01/07/2026

^¹,²

¹China Academy of Cultural Heritage, Chaoyang District, 100029, Beijing, China

²Beijing University of Civil Engineering and Architecture, Xicheng District, 100044, Beijing, China

Outline

Ingegneria Sismica

RC-CoSA: Controllable secure alignment architecture for large language models based on risk-constrained inference search

Abstract

Related Articles

Visual analysis of related hotspots affecting Diab...

Research on high-quality image super-resolution re...

Multi-scale Dual Transformer based Multi long-term...

Ultra-Short-Term Wind Power Forecasting Based on V...

Integration of Traditional Culture Elements and Co...

Open Access