Can robots powered by Artificial Intelligence (AI) accelerate outreach and impact of capacity building activities aimed at social development?
The International Training Centre of the International Labour Organization (ITCILO) provides a safe space for experimentation with innovative capacity development solutions that harness emerging technologies to promote social justice through decent work, including AI powered robots to boost learning and collaboration. This digital brief captures selected findings about action research on the near future feasibility of deploying these RobotGPTs.
Recent technological breakthroughs in AI have given the robots’ development new impetus. In a nutshell, thanks to Large Language Models like ChatGPT, the real-time speech interaction between robots and humans during learning and collaboration activities has moved from science fiction to near future feasibility – opening the door to a new paradigm in in education.
The term RobotGPTs refers to robotic systems whose high-level intelligence is driven by Large Language Models (LLMs). These systems combine multiple layers:
Collaborative robots or short Cobots are robots designed to work alongside humans. Cobots are typically predictable, sociable, adaptable, safe, and intuitive. They are already selectively used in manufacturing, warehousing, healthcare, hospitality, and research. Robot GPTs extend Cobots by adding conversational intelligence and high-level planning. Cobots are furthermore typically action-specific; designed to perform a certain set of functions only, and therefore generally not humanoid.
Two proxy indicators to track the dynamics of developments in a given technology domain are investments and patents. Fact is that large technology companies and well-funded start-ups are investing significantly in general-purpose humanoid robots and that patent applications have fast increased over the last ten years. Most recently, the number of patent applications has come down but presumably not due to a wane in interest but because technology starts maturing, with a shift of focus towards product quality and commercialization rather than patent quantity.
Potential uses of robot GPTs in adult learning include:
Advanced robots can be categorized by mobility, morphology, and autonomy:
LLMs are the ‘cognitive core’ that makes these robots generative and adaptive. LLMs are advanced neural networks trained on massive textual data sets to predict the next word and generate fitting prose. Their training equips them with a form of semantic knowledge and the ability to reason in language, and when integrated with robotics they can interpret natural-language instructions and formulate high-level plans. However, LLMs on their own do not have a built-in understanding of the physical world: they have no visual of objects or a sensation of their weight, and they have not experienced spatial constraints. In purely text-based form, an LLM cannot know what commands as basic as “grasp the bottle” mean in terms of joint trajectories or tactile feedback. Attempts to connect a chatbot to a robot can yield plausible but ungrounded commands, resulting in unrealistic or unsafe plans. To make language models useful for embodied agents, they must be coupled with sensory data, control policies, and planning frameworks.
The following approaches illustrate different attempts to ground language in perception and action:
ITCILO puts strong emphasis on the ‘action’ in its action research activities, with the explicit objective of feeding the findings into product pilots and sandbox experiments. For this reason, the action research on the future of learning with RobotGPTs emphasized on the near future up to 2030, which can be considered the first horizon for significant experimentation and pilot deployment. The research focused accordingly on assessing selected collaborative robot applications using LLMs that are either already available as prototypes or close to deployment. These applications were assessed through the lens of an adult learning service provider outside the formal education system, and vis-à-vis nine criteria: (ease of) integration into learning service cycles, cost, flexibility, collaboration potential, learning impact, maturity, ethical issues, environmental footprint, and equity.
Multi-criteria evaluation of RobotGPT applications (qualitative estimates)
Several common patterns and considerations emerge when comparing the competing systems mal across these evaluation criteria:
Taken together, these trends generally suggest that while the technology is advancing quickly in terms of task flexibility and human-machine interaction, significant gaps remain in affordability, maturity, governance, and sustainability.
RobotGPTs are an intriguing yet immature class of technologies. By combining large‑language‑model cognition with advanced robotics, they offer natural language interfaces, planning and reasoning, and embodied interaction. Prototypes such as Gemini Robotics, Helix, Phoenix, and Optimus demonstrate rapid advances in dexterity and collaboration. However, costs remain prohibitive, and most models are still in early stages of development. Safety, reliability, and ethical concerns further limit their readiness for widespread deployment. Yet the long‑term potential is significant: as AI and robotics continue to converge, costs will fall and capabilities will grow, creating opportunities to enhance adult learning through demonstration, simulation, translation, and facilitation.
1. Focus on complementarity. RobotGPTs should complement, not replace, human trainers. Use cases could include demonstration of complex research tasks, translation, and real-time feedback. Prioritize applications that address accessibility and inclusion, such as language interpretation or regulatory clarification.
2. Human-oriented by design. Commit to human‑centred design that involves stakeholders in co‑creation, mapping social contexts, measuring impacts, and managing risks.
3. Promote interoperable ecosystems. Encouraging the use of open‑source software and standard interfaces can avoid vendor lock‑in.
1. Launch RobotGPT Foresight Labs: Partner with universities and technology hubs to create immersive foresight studios combining VR, digital twins, and LLM‑enabled robots. These labs would let stakeholders “time‑travel” into possible futures and design policies that anticipate technological breakthroughs before they hit the mainstream.
2. AI‑Powered Peer Mentoring: Pair learners with “robot mentors” programmed using open LLMs and local labour laws. These robots would facilitate peer learning, moderate group simulations, and help participants practice complex negotiation or safety scenarios.
3. Campus Pilot of a Test Humanoid RobotGPT: ITCILO could pilot a humanoid RobotGPT on campus as a live prototype to assist with training activities, particularly in language support, research assistance and teamwork facilitation. The robot would integrate a multilingual large language model with vision and sensor data—similar to systems like PaLM‑E, enabling it to translate between languages, answer participant queries by retrieving and summarizing relevant materials, and moderate group exercises. By evaluating how learners and trainers interact with the prototype in real training scenarios, ITCILO can generate evidence‑based insights on the potential and limitations of RobotGPTs
For the long read of the action research findings and a list of reference documents