arXiv:2605.23926v1 Announce Type: new Abstract: Reasoning-capable large language models solve hard problems by emitting long chains of thought, paying heavily in latency, GPU time, and energy. Casual …
cyberintel.kalymoon.com · 31523 articles · updated every 4 hours · grows forever
arXiv:2605.23926v1 Announce Type: new Abstract: Reasoning-capable large language models solve hard problems by emitting long chains of thought, paying heavily in latency, GPU time, and energy. Casual …
arXiv:2605.23909v1 Announce Type: new Abstract: We investigate the calibration of large language models' (LLMs') confidence across diverse tasks. The results of our preregistered study show that the c…
arXiv:2605.23908v1 Announce Type: new Abstract: We are in the midst of large-scale industrial and academic efforts to automate the processes of scientific, technological and creative production throug…
arXiv:2605.25002v1 Announce Type: new Abstract: Memory-backed agents need provenance that can survive leaked or migrated snapshots, where logs, visible outputs, and trusted metadata may be absent. We …
arXiv:2605.24951v1 Announce Type: new Abstract: The advent of digital technologies has revolutionized traditional power distribution networks, transforming them into smart grids that are more reliable…
arXiv:2605.24949v1 Announce Type: new Abstract: Penetration testing is essential to securing modern web infrastructures, yet traditional manual methods struggle to keep pace with their scale and compl…
arXiv:2605.24941v1 Announce Type: new Abstract: Modern LLM agents combine long-term memory for personalization with tool-calling interfaces for taking actions in the world -- a combination underpinnin…
arXiv:2605.24903v1 Announce Type: new Abstract: Machine learning based malware detectors become obsolete over time due to concept drift in benign and malware applications. Recent methods rely on fully…
arXiv:2605.24834v1 Announce Type: new Abstract: Large language model (LLM) safety classifiers such as Llama Guard are effective at detecting overtly harmful prompts but remain vulnerable to adversaria…
arXiv:2605.24817v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) architectures have become an increasingly important paradigm for scaling Large Language Models (LLMs). As MoE models are increa…
arXiv:2605.24765v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly applied to cybersecurity question answering (QA) for critical tasks such as incident response and vulnerab…
arXiv:2605.24696v1 Announce Type: new Abstract: Streaming network intrusion detection systems must process flows continuously while keeping memory bounded, but most current methods leave alerting thre…
arXiv:2605.24663v1 Announce Type: new Abstract: This paper presents CyBOKClaw, an interpretable human-in-the-loop retrieval framework for mapping cybersecurity keywords or phrases (KWoPs) to the Cyber…
arXiv:2605.24632v1 Announce Type: new Abstract: Recent demonstrations of large language models producing candidate and confirmed vulnerabilities in production software have renewed the narrative that …
arXiv:2605.24559v1 Announce Type: new Abstract: Ransomware has grown to become one of the most damaging types of cybercrime, affecting private and public organizations in any sector. While early types…
arXiv:2605.24552v1 Announce Type: new Abstract: Representation engineering (RepE) defenses have shown strong robustness against jailbreak attacks on large language models (LLMs). However, these method…
arXiv:2605.24551v1 Announce Type: new Abstract: Cybersecurity awareness training has historically adopted a one-size-fits-all approach, despite established individual differences in how users process …
arXiv:2605.24542v1 Announce Type: new Abstract: This paper examines the erosion of Public Key Cryptography (PKC) security under adaptive adversarial optimisation driven by artificial intelligence. The…
arXiv:2605.24535v1 Announce Type: new Abstract: Jailbreak prompts can trigger harmful completions on aligned LLMs, In accordance, safety steering has been proposed: test-time activation interventions …
arXiv:2605.24421v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used as analyst assistants in security operations centers (SOCs), where they ingest log and alert data to …
arXiv:2605.24312v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) has become central to large language model (LLM) deployments, grounding responses in enterprise or proprietary data…
arXiv:2605.24309v1 Announce Type: new Abstract: We argue that LLM agent security is fundamentally an agent-human interaction (AHI) problem, not a purely algorithmic one. To substantiate this position,…
arXiv:2605.24300v1 Announce Type: new Abstract: Large language models (LLMs) are widely used for code generation, but their security reliability remains inconsistent across languages and prompting str…
arXiv:2605.24298v1 Announce Type: new Abstract: The growing use of Large Language Models (LLMs) for automated code generation has enhanced software development efficiency, but often at the cost of sec…