Baturay Saglam

Orcid: 0000-0002-8324-5980

According to our database1, Baturay Saglam authored at least 33 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Self-Improving In-Context Learning.
CoRR, May, 2026

Test-Time Safety Alignment.
CoRR, April, 2026

Test-Time Detoxification without Training or Learning Anything.
CoRR, February, 2026

Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report.
CoRR, January, 2026

2025
Think Before You Retrieve: Learning Test-Time Adaptive Search with Small Language Models.
CoRR, November, 2025

Risk-Averse Constrained Reinforcement Learning with Optimized Certainty Equivalents.
CoRR, October, 2025

Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report.
CoRR, August, 2025

Large Language Models Encode Semantics in Low-Dimensional Linear Subspaces.
CoRR, July, 2025

Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report.
CoRR, April, 2025

Learning Task Representations from In-Context Learning.
CoRR, February, 2025

Large Language Models Encode Semantics and Alignment in Linearly Separable Representations.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

Learning Task Representations from In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Parameter-Free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients.
Neural Process. Lett., April, 2024

Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach.
Trans. Mach. Learn. Res., 2024

Compatible Gradient Approximations for Actor-Critic Algorithms.
CoRR, 2024

Actor Prioritized Experience Replay (Abstract Reprint).
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Deep intrinsically motivated exploration in continuous control.
Mach. Learn., December, 2023

Actor Prioritized Experience Replay.
J. Artif. Intell. Res., 2023

Denoising Diffusion Adversarial Models for Unconditional Medical Image Generation.
Proceedings of the 31st Signal Processing and Communications Applications Conference, 2023

User Feedback-based Online Learning for Intent Classification.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-Aided MU-MISO Systems Under Hardware Impairments and Imperfect CSI.
Proceedings of the IEEE International Conference on Communications, 2023

2022
Off-Policy Correction for Actor-Critic Algorithms in Deep Reinforcement Learning.
CoRR, 2022

Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms.
CoRR, 2022

An Optimistic Approach to the Temporal Difference Error in Off-Policy Actor-Critic Algorithms.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2022

An Intrinsic Motivation Based Artificial Goal Generation in On-Policy Continuous Control.
Proceedings of the 30th Signal Processing and Communications Applications Conference, 2022

Unified Intrinsically Motivated Exploration for Off-Policy Learning in Continuous Action Spaces.
Proceedings of the 30th Signal Processing and Communications Applications Conference, 2022

Improving the Performance of Batch-Constrained Reinforcement Learning in Continuous Action Domains via Generative Adversarial Networks.
Proceedings of the 30th Signal Processing and Communications Applications Conference, 2022

Face Inpainting with Pre-trained Image Transformers.
Proceedings of the 30th Signal Processing and Communications Applications Conference, 2022

Bottleneck Sharing Generative Adversarial Networks for Unified Multi-Contrast MR Image Synthesis.
Proceedings of the 30th Signal Processing and Communications Applications Conference, 2022

2021
Parameter-Free Deterministic Reduction of the Estimation Bias in Continuous Control.
CoRR, 2021

Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

AWD3: Dynamic Reduction of the Estimation Bias.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021


  Loading...