Forums

Topic: The Nintendo Switch Thread

Posts 53,541 to 53,560 of 69,716

Grumblevolcano

@JaxonH I was thinking more about how the CPU was the big limiting factor with PS4/XB1, it was bad even for 2013.

Grumblevolcano

JaxonH

@Grumblevolcano
Ya, CPU would be upgraded, I imagine, and be a marked improvement over the PS4/X1 CPU. But even if it wasn’t, just having native power on par with 8th gen base consoles is a massive leap forward, old CPU or not. But assuming it did, and it had DLSS on top of it, the end result would be games that potentially look and run closer to PS4 Pro or Xbox One X.

If Nintendo doesn’t work with Nvidia to utilize Tensor cores on their next revision or successor, they’re hopelessly idiotic. But to even be able to utilize DLSS you need a certain base level of power. I don’t know where that line is drawn, but I do know it’s more than the current Switch power level. Which might explain why they’re making the system more powerful to begin with (assuming it’s true, of course). Maybe they needed that level of power in order to be able to leverage the DLSS, and so it was kind of like, a lot of bang for buck. Making the system that powerful not only gives you more power, but it also enables you to then get a further 1.5X - 2X increase with DLSS, so the bang for buck of the increase was much more appealing.

But who knows. Worst case, we wait til 2024-2025 for a new generation model. Either way we’re going to get a more powerful Switch in the next four years, And I would be downright amazed if they didn’t at least make the jump to 8th gen power level for their next gen system.

Psalms 22:16 (1,000 yrs before Christ)
They pierced My hands and feet
Isaiah 53:5 (700 yrs before Christ)
He was pierced for our transgressions

Switch Friend Code: SW-1947-6504-9005

Dezzy

JaxonH wrote:

The challenge would come in integrating DLSS into a mobile chip, which to my knowledge, has never been done before. Not to say it can’t be done- it just hasn’t been. Not yet anyways. But they haven’t had a reason to. Now they do. Switch being that reason.

I assume that part of it would be located in the dock? DLSS is just a better version of the MClassic really isn't it? In that it just takes in a video signal and works essentially as a post-processing shader.

Cos that can definitely be separated from the base console hardware.

It's dangerous to go alone! Stay at home.

Haruki_NLI

@Dezzy @JaxonH Remember last year Nvidia had job listings for an engineer, with one task listed being integrating AI upscaling techniques into mobile chipsets, such as Tegra.

So ya know, we all KNOW Nvidia is going to make it happen.

Now Playing: Mario & Luigi Brothership, Sonic x Shadow Generations

Now Streaming: The Legend of Zelda: Echoes of Wisdom

NLI Discord: https://bit.ly/2IoFIvj

Twitch: https://bit.ly/2wcA7E4

JaxonH

@Haruki_NLI
Huh. Ya, I do remember that now.

It's an interesting piece of the puzzle, for sure. And does seem to line up with the hypothesis.

Psalms 22:16 (1,000 yrs before Christ)
They pierced My hands and feet
Isaiah 53:5 (700 yrs before Christ)
He was pierced for our transgressions

Switch Friend Code: SW-1947-6504-9005

JaxonH

@Dezzy
It's nothing like mClassic.

It's deep learning super sampling. They feed a server low res images and ultra-high res images and it "learns" how to upscale the lower res images to look like the ultra-high res images. They do this with millions upon millions of images and train it. Each game has to be "trained" into the system separately.

Once complete, DLSS uses special AI accelerators called Tensor cores to process and upscale the rendered image in real time. But it does it before the signal is ever output. It has to be built in on the chip level.

Part of this is because in order to upscale the next frame it has to anticipate what the next frame will be, and it determines this by motion vectors embedded in the data.

Thus far, Tensor Cores are only in Nvidia RTX 20xx and 30xx GPUs (as far as gaming GPUs- I imagine they may be used in other product lines for automobiles). My RTX 2080 ti has them and can do raytracing and DLSS 2.0. The new GPUs (RTX 3070, 3080 and 3090, for example) can also do it. But if a mobile chip were to integrate Tensor Cores, it could do it too. But it's not something that can be done externally. It would require a new chip design with integrated Tensor Cores for the machine learning.

[Edited by JaxonH]

Psalms 22:16 (1,000 yrs before Christ)
They pierced My hands and feet
Isaiah 53:5 (700 yrs before Christ)
He was pierced for our transgressions

Switch Friend Code: SW-1947-6504-9005

Dezzy

@JaxonH

Right, but all of that is still post-processing though, so I'm not sure why it couldn't be separated in principle. Maybe it hasn't in practice so far, but that's a different issue.

It's dangerous to go alone! Stay at home.

JaxonH

@Dezzy
It’s not “post processing” though (at least, not in the terms you’re speaking of- like mClassic taking an already rendered 2D image and just sprucing it up- traditional post processing must happen before the signal is output, unlike mCable).

It’s part of the rendering “process”- I think that’s a better way to put it. Another way to put it is to clarify the difference between external post processing and internal post processing. mClassic is external. It’s just taking a 2D image and trying to use anti aliasing to clean up the jagged edge lines. Traditional post processing, and DLSS especially, are done internally, before the image is sent out as a 2D signal. It’s why the motion vectors are needed so that it can predict the next frame and render that upscaled frame with the tensor cores predicting what it should look like. It’s why tensor cores are needed, built into the die of the chip itself.

Basically what I’m saying is, it can’t happen in the dock.

[Edited by JaxonH]

Psalms 22:16 (1,000 yrs before Christ)
They pierced My hands and feet
Isaiah 53:5 (700 yrs before Christ)
He was pierced for our transgressions

Switch Friend Code: SW-1947-6504-9005

Dezzy

@JaxonH

How is it not? Motion vectors are in screen-space as well (at least every implementation I'm aware of, which is mostly Unreal and Unity), so that definitely counts as post processing. Every part of the algorithm that I'm aware of (which I only have a layman's understanding of, so maybe I'm missing something), happens in screen-space. That counts as post-processing by my lights.

It's dangerous to go alone! Stay at home.

JaxonH

@Dezzy
Again, read my above post.

There is a difference between external post processing an internal post processing. mClassic externally upscales a 2D image and cleans up jagged lines in a 2D image. Traditional post processing happens before the signal is output. It’s work done by the processor itself (and this explains why mClassic AA is far worse than traditional post processing).

DLSS requires Tensor cores built into the chip, and it must happen before the signal is output.

https://www.techspot.com/article/2049-what-are-tensor-cores/

I’m not up to speed on the cutting edge of external, auxiliary processing via USB-C connection- all I know is that the only GPUs that can do DLSS need Tensor Cores built in. Maybe that’ll change one day, but I’m not currently aware of anything that can do that as of right now.

[Edited by JaxonH]

Psalms 22:16 (1,000 yrs before Christ)
They pierced My hands and feet
Isaiah 53:5 (700 yrs before Christ)
He was pierced for our transgressions

Switch Friend Code: SW-1947-6504-9005

Dezzy

@JaxonH

"upscales a 2D image and cleans up jagged lines in a 2D image" is exactly the same thing traditional post processing would be doing though, so I'm not sure what the principled distinction is.

I've read that article, and the only thing I could find that actually speaks to the question of why it needs to be built into the GPU, is that they plan to use tensor cores for other things as well, like physics calculations. That certainly would require it being part of the GPU, because it would need to be done in world-space.

It's dangerous to go alone! Stay at home.

JaxonH

@Dezzy
I think that’s where the misunderstanding is coming in.

Cleaning up a 2D image is not what traditional post processing does. It does it before it’s ever creates the 2-D image signal. There is a very distinct difference between mclassic and traditional post processing..

Per Eurogamer:

So with all the setup done, what does the cable actually do? We've heard claims that the effect is similar to SMAA setting, with Marseille using a proprietary technique it calls contextual anti-aliasing (CXAA). The comparisons do have merit, even if it's not precisely the same. SMAA can be integrated into the rendering pipeline, whereas the mCable only gets the final, flat, 2D output of the console to look at. The mCable can't provide a deeper, engine-level understanding of the game, or an intelligent use of previous frames similar to temporal anti-aliasing. What you have instead is all done in post

The Tensor cores are required for the Tensor calculations, which GPUs are very good at. It takes a whole lot of CPU cores to do the same matrix multiplication operations a GPU can do. It also takes a lot of power and so they pack in dedicated cores specifically devoted to that.

I mean, I’m not sure what this is even about, or why you’re so insistent DLSS can be done from the dock without Tensor Cores? Even if it could be done, let’s just say, it’s not something that’s likely going to be done, so why even bother discussing it? There’s no clear advantage.

It’s like, trying to reinvent the wheel when the wheel’s already been invented. Use a chip with integrated tensor cores and voilà, you’ve got DLSS capabilities, with no drawbacks, for both docked and handheld. Do you think they’re going to gather their engineers and try to figure out a way to send a signal to the dock, deconstruct the image (if that’s even possible, I don’t know), let a built in dock GPU run all the calculations, recompile the image and send it back out (hopefully avoiding any lag in the process) which still leaves handheld mode lacking any benefit, rather than just... using a chip with Tensor cores? To what end?

I’m all for exploring different possibilities, but, there’s no evidence such a thing could even exist, and even if it could exist, there’s no evidence that it provides any benefit, and even if it could exist and could provide benefit, there’s no evidence it’s worth investing the time or money to figure out how, and there’s still the glaring issue of handheld not gaining any benefit, so any game being played that requires that power, when you undock the system, would be unplayable, crash the system, or worse. So... the whole conversation kinda seems moot.

I’m not saying an auxiliary processing unit in the dock is ruled out, but I think it’s fair to say if they take that particular approach, DLSS is almost certainly off the table. And if such an approach was ever to be utilized, it would all but ensure the additional power doesn’t get taken advantage with games that couldn’t otherwise run on the system. For the same reason Nintendo didn’t let us use external hard drives (undocking would crash the game) I don’t think they would be in favor of auxiliary power. Unless it was negligible to the point it couldn’t make new games possible on the system, and only enhanced playability of existing games.

[Edited by JaxonH]

Psalms 22:16 (1,000 yrs before Christ)
They pierced My hands and feet
Isaiah 53:5 (700 yrs before Christ)
He was pierced for our transgressions

Switch Friend Code: SW-1947-6504-9005

skywake

@JaxonH @Dezzy
I see no reason why they couldn't in theory add post processing to the dock. The problem is for it to be worthwhile and better than the kind of upscaling your TV could do it would increase the cost of the dock significantly. To the point where they'd be losing money on the standalone dock which is significant because I'm fairly confident there's a significant markup on the standalone dock.

But for less than that they could release a new SKU with more horsepower in general, decent DLSS upscaling built in and include a higher HDMI spec dock. Which would give performance advantages both docked and undocked AND give the Switch a bit more momentum at some point next year.

Some playlists: Top All Time Songs, Top Last Year
An opinion is only respectable if it can be defended. Respect people, not opinions

Dezzy

@JaxonH

I see your misunderstanding and I raise you!

The confusion is that you're conflating anti-aliasing specifically with post-processing in general. There are like 20 different mainstream anti-aliasing approaches. Some of them work as regular post-processing shaders (e.g FXAA, NFAA), but others don't, or are just automated by the GPU hardware (as MSAA usually is). There's even one that tries to run on the CPU, which sounds kind of mental to me, but it exists all the same.

So when I said MClassic works the same as a regular post-processing shader, I assumed we meant if it was using the same algorithm. FXAA implemented by an MClassic would be exactly the same as FXAA done by the GPU, so that's something you could separate into a different piece of hardware with no problems.

The general point I was trying to get at, was that it might be easier and cheaper for them to develop if they could separate that DLSS function into the dock, because then they would have to worry less about how to get the necessary power source and cooling system into a handheld device (that they want to fit into someone's pocket)

It's dangerous to go alone! Stay at home.

JaxonH

@skywake
Post processing, sure. I’ve never disputed the fact they could, in theory, use auxiliary processing in the dock. And certainly it could be better than upscaling your TV does.

But DLSS specifically? That’s something else entirely. DLSS is not just fancy upscaling. It’s a taxing process utilizing cores devoted to advanced machine learning prediction within the rendering pipeline. If it was that easy then we would see auxiliary devices offered to make DLSS viable on PCs that don’t have RTX 20xx and 30xx GPUs with Tensor Cores (the vast majority of the PC market).

That said, there is no real reason to think they’d go for auxiliary power in the dock regardless. Even if we weren’t hearing about the DLSS (which pretty much writes it off entirely, since you need a chip with Tensor Cores), the marginal benefit you get from additional processing in the dock could simply be done with a more powerful chip that benefits both docked and handheld mode.

The only way we see auxiliary dock processing is if it’s negligible increase. Otherwise why bother. If you’re going to make it a large increase then you want to get bang for your buck for that investment. Such as games running that couldn’t previously run. And if you do that when it’s only possible in the dock, well, the problem is self evident. You undock, and the game crashes.

So if DLSS is true, it’s a new chip using Volta or Ampere with Tensor Cores. Open and shut. If DLSS is not true, I still don’t think they’re going to do auxiliary power in the dock for the reasons stated above (although I won’t go so far as discounting it entirely). Why not just make a more powerful chip that benefits both modes of use, which is basically what you’re saying.

[Edited by JaxonH]

Psalms 22:16 (1,000 yrs before Christ)
They pierced My hands and feet
Isaiah 53:5 (700 yrs before Christ)
He was pierced for our transgressions

Switch Friend Code: SW-1947-6504-9005

Dezzy

skywake wrote:

I see no reason why they couldn't in theory add post processing to the dock. The problem is for it to be worthwhile and better than the kind of upscaling your TV could do it would increase the cost of the dock significantly. To the point where they'd be losing money on the standalone dock which is significant because I'm fairly confident there's a significant markup on the standalone dock.

Right. My assumption was that the cost of the dock would just be substracted from the cost of the console though, given that the equivalent tech would need to be added to the GPU otherwise. Probably not true in practice I realize, but in principle $A + $B= $(A+B)

It's dangerous to go alone! Stay at home.

JaxonH

[Edited by JaxonH]

Psalms 22:16 (1,000 yrs before Christ)
They pierced My hands and feet
Isaiah 53:5 (700 yrs before Christ)
He was pierced for our transgressions

Switch Friend Code: SW-1947-6504-9005

Dezzy

JaxonH wrote:

@Dezzy
then I see no reason to believe they could figure out how to do it for something infinitely more complex that requires it’s own Tensor Cores on the chip, and would ultimately introduce a fatal design flaw of games not working when undocked.

Well I assume they're not gonna have that function working when it's undocked either way. I assume the power draw will just be too high to run off a battery. Plus you don't really need any higher than 720p on a handheld screen. So either way, I'm assuming they're gonna be developing a version where the tensor cores only activate when docked.

It's dangerous to go alone! Stay at home.

Giancarlothomaz

@Magician we need to remember/consider that the Legend of Zelda franchise is notorious for the delays until the final release of it games, the significant impact of the pandemic in the development of Zelda Breath of the Wild 2 and the potencial release of the remaster of Wind Waker e Twilight Princess on Switch.

i like HD Rumble.

X:

JaxonH

@Dezzy
I don’t think that’s the case. DLSS would make it less demanding to run in handheld, not more.

Handheld games could render at 1080p for the cost of rendering 540p or 720p. That’s the whole benefit. It lowers the power required for the same performance, or increases the performance for the same power drawn.

So ya, DLSS in handheld would mean getting 1080p on every single game across the board, with battery life equivalent to rendering 540p or 720p.

[Edited by JaxonH]

Psalms 22:16 (1,000 yrs before Christ)
They pierced My hands and feet
Isaiah 53:5 (700 yrs before Christ)
He was pierced for our transgressions

Switch Friend Code: SW-1947-6504-9005

Please login or sign up to reply to this topic