About DeepPolicy

Redefining Optimization Controlfor Process Industrywith Industrial Agents

DeepPolicy originated from Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences. With engineering-oriented reinforcement learning at its core, DeepPolicy provides autonomous optimization industrial agent systems for process industry, energy, semiconductor, and refrigeration sectors — advancing industrial control from reactive response to proactive optimization.

Proprietary RL Algorithms

Enterprise Partners

Plant-wide Efficiency Gain

High-level Publications

Research Origin

Originated from Shenzhen Instituteof Advanced Technology, CAS

The team has spent years in reinforcement learning, optimal control, and industrial system modeling, undertaking national key research programs and publishing 40+ papers at top venues including NeurIPS, ICML, and AAAI — establishing a reliable engineering pathway from frontier algorithms to real industrial processes.

Reinforcement LearningOptimal ControlIndustrial System ModelingSafe ExplorationOffline Policy EvaluationImitation LearningMulti-agentNeurIPS / ICML / AAAI

Research Pillars

Algorithm Research

Continuous investment in safety constraints, sample efficiency, and offline training for engineering-deployable RL.

Platform Engineering

Distilling laboratory algorithms into deliverable platform foundations: ReinforceOS / ReinforceLab / ReinforceBox.

Field Practice

Deep engagement in petrochemical, energy, and data center frontlines — feeding real operating conditions back into research for closed-loop iteration.

Milestones

From Research to Industry

Key milestones at each stage, witnessing how algorithms progressively land in real industrial environments.

Phase 01

2022

Technology Accumulation

Deep research in reinforcement learning algorithms, successfully overcoming challenges in environmental disturbance and observation information trends. Published multiple papers in top international journals and obtained patent authorizations in China, the US, Europe, and Japan.

Phase 02

2023

Technology Validation

Industrial agent took initial shape — core technology completed real-world validation at a factory in Tokyo, Japan, earning industry recognition. Concurrently completed collaboration with Huawei on refrigeration system energy-saving.

Phase 03

2024

Seeking Partnerships

Company officially established and operational, with the overall industrial agent architecture substantially refined. Broadly seeking compatible scenarios and partners across process industries while accelerating technology implementation and commercialization.

Phase 04

2025

Industry Validation

Industrial agent achieved scenario-specific application validation. Actual deployments began in petrochemical and material chemistry sectors for quality and efficiency gains; deployments completed in refrigeration systems and data centers to reduce cooling energy consumption.

Phase 05

2026

Product Platform Launch

Officially launched ReinforceOS, ReinforceLab, and ReinforceBox — three major products forming the complete 'Platform — Validation — Terminal' closed loop.

Ecosystem Partners

Validated Side-by-Side with Industry Leaders

We have established long-term partnerships with leading DCS suppliers and process industry enterprises in China, completing algorithm validation and scaled deployment on real equipment.

DCS Partner

DCS Control System Partners

Deep collaboration with mainstream DCS systems, delivering policies to process plants.

Process Industry

Process Industry Leaders

Algorithm validation completed on real operating conditions at petrochemical and chemical plants.

Data Center

Large Data Center Operators

Cooling group control / PUE optimization / energy management validated in production.

Research Ally

Research Institutions & Universities

Leveraging CAS research resources to continuously advance algorithm frontier research.

Deep partnerships established with 10+ leading enterprises, continuously expanding.

Core Capabilities

What We Do

Building a complete capability stack across algorithm, platform, and terminal layers — making policies trainable, verifiable, deployable, and maintainable.

Industrial Agents

RL-based autonomous decision agents that continuously seek optima under constraints, replacing manual tuning and rule-based systems.

Optimization Control Platform

An integrated engineering platform for data ingestion, environment modeling, policy training, and online evaluation for industrial processes.

Data Validation & Replay

Algorithm verification, policy replay, and return estimation on offline data — reducing field trial-and-error costs.

Edge Control Terminal

Industrial-grade AI control box supporting DCS/PLC integration for low-latency closed-loop control on-site.

Edge-Cloud Collaboration

Cloud training / edge deployment / bidirectional sync — policies can be remotely upgraded and field-adaptive.

Energy Efficiency & Stability

Continuous optimization around PUE, unit consumption, and stability metrics to deliver measurable business returns.

Safety & Compliance

Constrained RL and safe exploration mechanisms to strictly protect process boundaries and production safety.

Cross-industry Reuse

Templates covering petrochemical, power, water, semiconductor, and data center sectors for rapid scenario migration.

Industry Scenes

The Real Environments We Serve

Across petrochemical, material chemistry, paper manufacturing, and refrigeration systems, each real-world site feeds operating conditions back into the algorithms, continuously refining the engineering boundaries of RL.

Petrochemical01

Petrochemical

Industrial agents aggregate tens of millions of data points from plant zones, pipelines, and logistics in real time, dynamically optimizing reaction temperature, pressure, and feedstock ratios while ensuring safety interlocks, improving product quality consistency, reducing energy consumption and carbon emissions, and providing early warnings for leaks and corrosion.

Learn More

Material Chemistry02

Material Chemistry

Industrial agents analyze multi-variable coupling across batching, reaction, separation, purification, drying, and calcination processes to auto-optimize formulations and process parameters, reducing batch quality variance and shortening the cycle from pilot to mass production.

Learn More

Paper Industry03

Paper Industry

Industrial agents coordinate pulping, stock preparation, drying, and reeling processes — moving beyond fixed expert rules to adjust consistency, machine speed, and steam usage in real time, stabilizing paper basis weight and moisture while reducing cost waste and dryer energy consumption.

Learn More

Refrigeration04

Refrigeration Systems

Industrial agents combine cooling load prediction with environmental changes to autonomously control compressor start/stop, frequency, condensing pressure setpoints, and air outlet volume, keeping refrigeration efficiency on the optimal COP curve, maintaining stable space temperatures, and delivering anomaly diagnostic recommendations.

Learn More

Mission & Vision

Bringing Advanced Algorithms into Industrial Reality

We believe the deciding factor in next-generation industrial control is not more sensors or bigger databases — it's "self-optimizing strategy."

Mission

Make Industrial Control Self-evolving

With RL at the core, deliver verifiable, continuously-evolving autonomous optimization agents to process industry, energy, semiconductor, and refrigeration sectors — helping real production shift from "reactive response" to "proactive optimization."

Vision

Become the Global Foundation for Industrial Intelligence

Build a full-chain product system covering platform, validation, and terminal — enabling every critical plant to run safer, more efficiently, and with lower carbon footprint through continuous algorithmic collaboration.

Start a Verifiable Partnership

If your enterprise is looking for a deployable industrial intelligence optimization path, we welcome you to get in touch.

Send a Message

Research Center Shenzhen

Building F, 3F, 1068 Xueyuan Ave, Xili, Nanshan District, Shenzhen

SIAT, Chinese Academy of Sciences · Materials AI Research Center

R&D Center Beijing

Building 9, 4F, Room 406-2185, No.10 Automotive Museum West Rd, Fengtai, Beijing

Beijing DeepPolicy Technology Co., Ltd.

Business Center Ma'anshan

Ma'anshan Software Park E1, Huashan District, Ma'anshan, Anhui

Ma'anshan DeepPolicy Technology Co., Ltd.