Daniel in SD
(supervised)
90% success. I don’t remember if that meant 90% without interventions or if there was a more strict criteria. I don’t think it ever achieved 90% without interventions with V11. I lost a few beers.What's the metric?
You can install our site as a web app on your iOS device by utilizing the Add to Home Screen feature in Safari. Please see this thread for more details on this.
Note: This feature may not be available in some browsers.
90% success. I don’t remember if that meant 90% without interventions or if there was a more strict criteria. I don’t think it ever achieved 90% without interventions with V11. I lost a few beers.What's the metric?
Practically, FSD Beta hasn't really been used that much on city streets, so most of the recent miles come from highway driving with 11.x and some increases with wider release population. I would guess FSD Beta is around 15M miles/month off highways, so across a fleet of ~450k, that's about 1 mile a day per vehicle. So the near-term effect of V12 will likely be minimal on these safety numbers unless it's so comfortable that it drives up usage on secondary streets.The V12 work is focused on addressing driving on secondary streets, where accident rates roughly triple. I imagine the miles per accident figure will drop significantly unless V12 has solid accident avoidance capabilities.
That was the effect that I was focusing on. If folks start using FSD on secondary roads, the numbers should start to fall as they put their cars in harm's way.So the near-term effect of V12 will likely be minimal on these safety numbers unless it's so comfortable that it drives up usage on secondary streets.
Detailed extensively in 10.69 thread.What's the metric?
Same metric as before.90% success. I don’t remember if that meant 90% without interventions or if there was a more strict criteria. I don’t think it ever achieved 90% without interventions with V11. I lost a few beers.
Unfortunately, FSD training hasn't been TSLA's strong suit.
90% success (March of Nines!) not 100% success though, right?Detailed extensively in 10.69 thread.
Same metric as before.
No interventions by the driver or other drivers. (Covers creeping at the wrong time or too far, cutting it too close, not accelerating up to speed and forcing traffic to go around or slow down, etc.). Any honking from other drivers is disqualifying.
Missing large easy opportunities (more than five-second gaps).
So basically just normal driving. No mistakes, no weirdness, no interventions.
Using the median is weirdness and very silly, but that’s allowed.
Anyway no change from prior rules in 10.69 thread. That covers minimum number of attempts too.
We can only say that the Youtubers are testing here. Speaking for Tesla is pure speculation.The V12 work is focused on addressing driving on secondary streets
Probably stuck at a 4-way stop or maybe a rotaryThere was chatter that V12 was supposed to start going actually public with a slow roll starting this weekend. Been pretty quiet so far.
There was? Only things I heard was:There was chatter that V12 was supposed to start going actually public with a slow roll starting this weekend. Been pretty quiet so far.
Wishful thinking, perhaps.There was? Only things I heard was:
1. From Elon at earnings call: Starting to roll out in a few weeks.
2. From Omar: Thinks Tesla said they might be opening it up to a few additional public testers this weekend.
Where did you hear going public starting this weekend? Unless you're referring to (2) above.
Is that a Pabst Blue Ribbon, Schlitz, or Miller Highlife?Ah yes, our wager. In hindsight it was obvious that traditional programming would never be able to negotiate the arc of Chuck’s UPL. There’s just no way to calculate the future positions of all the vehicles with C code!
I will bet a beer that Chuck’s first version of FSD V12 will finally achieve 90% performance. V12 will be the start of the March of Nines!
Well, if they’d do a better job of sucking up maybe they’d get earlier access!Wishful thinking, perhaps.
The OG youtuber crowd must be livid that Omar is getting all the V12 views.
What do you mean by that?
To date, the only output we've seen from Tesla's video data training efforts is the birds-eye-view representation of the world. And perception has always been the strongest piece of FSD.
It comes tonight at 1 am your local time !!There was chatter that V12 was supposed to start going actually public with a slow roll starting this weekend. Been pretty quiet so far.
Yep 90%. This has never been accomplished with any prior FSD version.90% success (March of Nines!) not 100% success though, right?
I can't believe it's been a year and a half since 10.69!
(Elon apparently didn't watch the video. LOL)
I'm looking forward to watching 1000 successful turns when they get the third nine. I wonder what Chuck's manual driving success rate is?Yep 90%.
It is an honor to do this again and I do hope I finally have to pay for a beer.
There were some caveats about how much traffic as well (seems silly to give it a pass with extremely light traffic where it is just sailing through every time). Easiest thing would be to just do the results of all attempts on the first version, since even if Chuck does a light traffic video he will follow up with one in heavier traffic (usually). Usually he does not do 10 attempts in one video (our minimum number IIRC) but want to avoid the chance of a trivial 10 attempts being successful, and somehow labeling that a success.
Anyway once v12 easily accomplishes this hurdle, we can take bets on the second 9.
I wonder what Chuck's manual driving success rate is?
Chuck has said he's not aware of any collision at the UPL. Of course our wager isn't the same as collision rate. You could get one 9 for collisions just YOLOing it like this:He's probably done it hundreds of times (including of course his attempts while supervising FSD which also count).
Would be nice to know how many accidents there have been there. Low hundreds of attempts a day, probably, meaning a little less than 100k attempts per year.
Would 1 accident out of 1 million (unsupervised) attempts on this specific turn be good enough? I'm not sure how I feel about that. I guess it depends on the failure mode.
True! This is an impossible problem for traditional programming. Complex physics calculations like that have never been implemented in code. The NNs will just know and sense when it is the right time to go, based on millions of video clips. Much better than having a concrete, quantifiable idea of how much margin is required. Just trust the AI, it's flawless; we can see that now with the generative AI revolution.In hindsight it was obvious that traditional programming would never be able to negotiate the arc of Chuck’s UPL. There’s just no way to calculate the future positions of all the vehicles with C code!
Of course our wager isn't the same as collision rate
Eva looked both ways. Totally in control.just YOLOing it like this: