Umjetna inteligencijadb#2650

OpenSeeker razbija monopol: AI pretraživač s 11.700 podataka pobjeđuje Alibabu

(16h ago)
Shanghai, China
the-decoder.com
OpenSeeker razbija monopol: AI pretraživač s 11.700 podataka pobjeđuje Alibabu

OpenSeeker razbija monopol: AI pretraživač s 11.700 podataka pobjeđuje Alibabu📷 © Tech&Space

  • 11.700 podataka dovoljno za vrhunske rezultate
  • OpenSeeker nadmašuje Alibabin Tongyi DeepResearch
  • Otvoreni kod i podaci mijenjaju pravila igre

Istraživači sa Šangajskog Jiao Tong sveučilišta objavili su OpenSeeker, AI pretraživač koji s samo 11.700 točaka za obuku postiže rezultate usporedive s Alibabinim Tongyi DeepResearchom – a sve je otvoreno i dostupno pod MIT licencom. Riječ je o rijetkom primjeru gdje minimalni podaci i jednostruka obuka ne znače kompromis na performansama.

OpenSeeker postiže 48,4% na kineskom BrowseComp-ZH benchmarku, nadmašujući Alibabu za 1,7 postotnih bodova, dok na engleskom OpenAI-ovom BrowseCompu bilježi 29,5% – gotovo dvostruko više od konkurentskog DeepDivea. Ono što čini OpenSeeker zanimljivim nije samo brojčana nadmoć, već način na koji je postignuta.

Umjesto da se oslanja na ogromne skupove podataka, tim je koristio stvarne strukture web poveznica za generiranje pitanja i odgovora, prisiljavajući model na višestepeno pretraživanje i zaključivanje. Istraživači su objavili sve – od podataka do modela, što je rijetkost u industriji gdje su čak i akademski projekti često poluzatvoreni.

No, koliko je ova otvorenost zapravo održiva u svijetu gdje su podaci nova nafta? OpenSeekerov uspjeh sugerira da kvaliteta podataka može nadoknaditi kvantitetu, ali koliko je to primjenjivo izvan kontroliranih uvjeta?

Benchmark brojevi zvuče impresivno, ali tko zapravo dobiva prednost?

Benchmark brojevi zvuče impresivno, ali tko zapravo dobiva prednost?📷 © Tech&Space

Benchmark brojevi zvuče impresivno, ali tko zapravo dobiva prednost?

Benchmark rezultati, iako impresivni, uvijek dolaze s upozorenjem: laboratorijski testovi rijetko se poklapaju s realnim scenarijima. OpenSeekerov uspjeh na BrowseComp-ZH i BrowseCompu sugerira da kvaliteta podataka može nadoknaditi kvantitetu, ali koliko je to primjenjivo izvan kontroliranih uvjeta?

Alibaba i drugi veliki igrači već godinama grade zatvorene ekosustave, gdje su podaci i modeli ključna konkurentska prednost. Pravi test za OpenSeeker bit će kako će se ponašati u divljini – na stvarnim upitima, s šumom i nepredvidivim korisničkim ponašanjem.

Iako otvoreni pristup privlači akademsku zajednicu i male igrače, velika tehnološka poduzeća teško će odustati od svojih zatvorenih sustava bez borbe. GitHub aktivnost i reakcije na forumima poput Hugging Facea bit će ključni pokazatelj koliko je projekt zapravo relevantan izvan medijskih naslova.

Ono što je sigurno: OpenSeeker je signal da monopol na AI podatke nije nepobjediv. Ali hoće li to biti dovoljno da se promijene pravila igre, ili će ostati akademski eksperiment?

Uspjeh OpenSeekera na benchmarkovima je impresivan, ali njegova održivost u svijetu gdje su podaci nova nafta je još uvijek nepoznana. Ako će OpenSeeker uspjeti u divljini, ovisi će o njegovoj sposobnosti da se prilagodi realnim scenarijima i korisničkim potrebama. Svojim otvorenim pristupom, OpenSeeker može postati važan dio akademskih i istraživačkih zajednica.

OpenSeeker benchmark vs. Alibaba AI searchAI search engine performance comparison (11,700 datasets)Enterprise AI search competitionMultilingual search benchmarkingOpen-source AI search alternatives

//Comments

AIAmazon’s $50B OpenAI bet: Trainium’s real test begins nowSpaceMapping the Local Bubble’s magnetic field reshapes cosmic scienceAIGoogle’s Gemini games flop: AI hype hits gamer realitySpaceStarship’s Tenth Test: The Reusability Threshold CrossedAINvidia’s AI tax: half your salary or half your careerSpaceJWST peels back dust to reveal star birth in W51AITriangle Health’s $4M AI won’t replace your doctor—yetSpaceAI’s Copyright Chaos Threatens Space Exploration DataAIHumble AI is just healthcare’s latest buzzword for ‘don’t trust us yet’SpaceExoplanet spins confirm a planetary mass ruleAIOpenAI’s teen safety tools: open source or open question?GamingCrimson Desert’s AI art fail: a mockup that slipped throughAITinder’s AI gambit: swiping left on endless swipingGamingPearl Abyss hid AI assets in Crimson Desert—now players want answersAINVIDIA’s Alpamayo AI: Self-Driving’s Hardest Problem or Just Another Demo?GamingCapcom Rejects AI AssetsAIWaymo’s police problem exposes AV’s real-world blind spotsRoboticsAtlas Redefines Humanoid DesignAILittlebird’s $11M bet: AI that reads your screen—without the screenshotsRoboticsOne antenna, two worlds: robot sniffs out realityAIUK firms drown in AI hype, emerge with empty spreadsheetsRoboticsDrone swarms take flight—but not off the demo lot yetAIApple’s Gemini Distillation: On-Device AI Without the Cloud HypeTechnologyTaiwan’s chip giants bet on helium and nukes to dodge supply shocksAICapcom’s AI partner talk is just corporate speak for ‘we’ll use it carefully’MedicineTelmisartan Boosts Cancer TreatmentAIOpenSeeker’s open gambit: Can 11K data points break AI’s data monopoly?MedicineXaira Unveils X-CellAIGimlet Labs Solves AI BottleneckMedicineAI Fails to Speed Lung Cancer DiagnosisAIHelion Powers OpenAIAINVIDIA’s OpenShell: Security for AI Agents or Just Another Hype Shell?AIDRAFT Boosts AI SafetyAIProject Glasswing: AI finds flaws everywhere—except in its own hypeAIPAM: Complex Math for a 10% Performance HitAIOpenAI’s erotic chatbot pause exposes AI’s adult content dilemmaAIAI Ranks Recovery Factors—but Who’s Really Listening?AIDeepMind’s AI safety play: real guardrails or just another demo?AIAmazon’s $50B OpenAI bet: Trainium’s real test begins nowSpaceMapping the Local Bubble’s magnetic field reshapes cosmic scienceAIGoogle’s Gemini games flop: AI hype hits gamer realitySpaceStarship’s Tenth Test: The Reusability Threshold CrossedAINvidia’s AI tax: half your salary or half your careerSpaceJWST peels back dust to reveal star birth in W51AITriangle Health’s $4M AI won’t replace your doctor—yetSpaceAI’s Copyright Chaos Threatens Space Exploration DataAIHumble AI is just healthcare’s latest buzzword for ‘don’t trust us yet’SpaceExoplanet spins confirm a planetary mass ruleAIOpenAI’s teen safety tools: open source or open question?GamingCrimson Desert’s AI art fail: a mockup that slipped throughAITinder’s AI gambit: swiping left on endless swipingGamingPearl Abyss hid AI assets in Crimson Desert—now players want answersAINVIDIA’s Alpamayo AI: Self-Driving’s Hardest Problem or Just Another Demo?GamingCapcom Rejects AI AssetsAIWaymo’s police problem exposes AV’s real-world blind spotsRoboticsAtlas Redefines Humanoid DesignAILittlebird’s $11M bet: AI that reads your screen—without the screenshotsRoboticsOne antenna, two worlds: robot sniffs out realityAIUK firms drown in AI hype, emerge with empty spreadsheetsRoboticsDrone swarms take flight—but not off the demo lot yetAIApple’s Gemini Distillation: On-Device AI Without the Cloud HypeTechnologyTaiwan’s chip giants bet on helium and nukes to dodge supply shocksAICapcom’s AI partner talk is just corporate speak for ‘we’ll use it carefully’MedicineTelmisartan Boosts Cancer TreatmentAIOpenSeeker’s open gambit: Can 11K data points break AI’s data monopoly?MedicineXaira Unveils X-CellAIGimlet Labs Solves AI BottleneckMedicineAI Fails to Speed Lung Cancer DiagnosisAIHelion Powers OpenAIAINVIDIA’s OpenShell: Security for AI Agents or Just Another Hype Shell?AIDRAFT Boosts AI SafetyAIProject Glasswing: AI finds flaws everywhere—except in its own hypeAIPAM: Complex Math for a 10% Performance HitAIOpenAI’s erotic chatbot pause exposes AI’s adult content dilemmaAIAI Ranks Recovery Factors—but Who’s Really Listening?AIDeepMind’s AI safety play: real guardrails or just another demo?
⊞ Foto Review