๐๐น๐ฎ๐ฐ๐ธ๐ฅ๐ผ๐ฐ๐ธ ๐ฅ๐ฒ๐๐ฒ๐ฎ๐ฟ๐ฐ๐ต๐ฒ๐ฟ๐ ๐๐ฒ๐๐ฒ๐น๐ผ๐ฝ ๐๐ ๐๐ด๐ฒ๐ป๐ ๐ฆ๐๐๐๐ฒ๐บ ๐ณ๐ผ๐ฟ ๐ฆ๐๐ผ๐ฐ๐ธ ๐ฃ๐ถ๐ฐ๐ธ๐
Instead of relying on one frontier model, BlackRock built three AI โagentsโ that mimic different analyst roles:
โข Fundamental Agent โ parses 10-Ks and earnings reports
โข Sentiment Agent โ reviews news and analyst ratings
โข Valuation Agent โ studies prices, volatility, and volumes
Each agent analyzes a stock independently, then enters a round-robin debate. Disagreements are argued until the agents reach consensus on whether to BUY or SELL โ a process designed to mimic an investment committee.
The system runs on Microsoft's AutoGen framework using GPT-4o, with custom tools for each agent: document parsing for 10-Ks, news summarization, and volatility calculators.
The agents' recommendations change based on risk tolerance settings. The same volatile stock might get a SELL from a risk-averse agent but a BUY from a risk-neutral one analyzing identical data.
Tested on 15 tech stocks over four months in 2024, the system outperformed both single agents and the benchmark in risk-neutral portfolios on a risk-adjusted basis (Sharpe ratios).
In risk-averse portfolios, all approaches lagged the benchmark โ since volatile tech names were excluded โ but the multi-agent showed smaller drawdowns than single agents.
The authors argue this setup improves analytical rigor and helps mitigate behavioral biases like overconfidence. While limited in scope and not a full portfolio optimizer, the study suggests specialized, debating agents may prove more reliable than general models for quantitative finance.
Takeaway: LLMs can be a Swiss Army knife in daily life, but for mathematical analysis, specialized agents may be the sharper tool.
ai-street.co/i/18358206โฆ