I completely ignored Anthropic’s advice and wrote a more elaborate test prompt based on a use case I’m familiar with and therefore can audit the agent’s code quality. In 2021, I wrote a script to scrape YouTube video metadata from videos on a given channel using YouTube’s Data API, but the API is poorly and counterintuitively documented and my Python scripts aren’t great. I subscribe to the SiIvagunner YouTube account which, as a part of the channel’s gimmick (musical swaps with different melodies than the ones expected), posts hundreds of videos per month with nondescript thumbnails and titles, making it nonobvious which videos are the best other than the view counts. The video metadata could be used to surface good videos I missed, so I had a fun idea to test Opus 4.5:
She was so good in fact that she was soon promoted to commander, in another first.
。关于这个话题,夫子提供了深入分析
The game server runs at 10 “ticks” a second. Every tick we move and grow players, eat fruit, calculate collisions, and broadcast the new gamestate to clients.
По данным 59.ru, ракетную опасность в Пермском крае ввели в 14:40 по местному времени. Отмечается, что все оперативные службы находятся в полной готовности.
A year later, Kennedy's dream was posthumously seen to fruition. A small step was taken and mankind took its giant leap. The New Nine had done their job.