The GPT-4o Shift: What PMs Should Pay Attention To
OpenAI released GPT-4o this month, and the most significant change is not the speed improvement. It is the multimodal capability. The model natively handles text, images, and audio in a single interaction. For project managers, this opens up workflows that were not practical before.
Multimodal Documentation
I have already started experimenting with feeding GPT-4o screenshots of whiteboard sessions and asking it to extract action items and decisions. The results are surprisingly good. A photo of a messy whiteboard from a design session becomes a structured summary in thirty seconds.
This matters for distributed teams. When a co-located group has a whiteboard session, the remote members usually get a blurry photo and a brief Slack message. Now I can process that whiteboard into a proper document and share it within minutes.
Voice Interaction Changes the Context
The real-time voice capability is interesting for a different reason. It lowers the barrier to using AI during meetings. Instead of typing a prompt after the meeting, you could theoretically interact with the model during a meeting to pull up data or check against previous decisions. We are not there yet in practice, but the direction is clear.
What Has Not Changed
The fundamentals have not changed. AI still cannot replace judgment, relationship management, or accountability. GPT-4o is faster and more capable, but it is still a tool that needs a skilled operator.
I also maintain the same caution about data governance. Just because the model can process your sprint board screenshot does not mean you should feed it screenshots containing client names, financial data, or proprietary information. Check your organization's AI usage policy first.
The Practical Takeaway
If you are already using GPT-4 in your PM workflow, upgrade and test the multimodal features. If you have not started yet, this is a good entry point. The voice and vision capabilities make it more accessible than ever. Start small, stay cautious with sensitive data, and iterate.
←Back to all posts