Monday, February 27, 2023

Smart-Compression

Similar Wavelet Conversion with minimal reprocessing : Smart Access : RS

(repeated encoding cost reduction) i know you are a coder, you could help ffmpeg & avx on the FX8320E, Likewise consoles face same issue with FFMPEG & Codecs & likewise with media acceleration by non repetition of encoding

Similar Wavelet Conversion with minimal reprocessing : Smart Access : RS

Printing Technology 'When you "Tie" the Knot' : 
We want those Hand drawn Donald duck, Micky & Daffy in true line drawn splendour, 
But hand drawing 8K is hell, 
Remaster printing technology : For all monitors, TV's & Operating systems : DTS, Dolby : Functioning wave conversion

Smart-De-Compression : repeated encoding cost reduction : (c)Rupert S


Wavelet Classifiers

Audio
Video
Compressed Data, GZip, BZip, LZH

Primarily our goal is to Originate Encode in a form that is Compatable with the hardware chain,

For example in the case of HDD > CPU > GPU the right Texture & Number formats, Often 16Bit or 32Bit float & Texture,

However with Video we have to expand the frame wavelets into Compatable Texture formats!

We convert the Video Wavelet in Smart Access to the closest Texture format wavelet; Or directly play the video! But suppose we are using Bink Video? We directly convert & keep wavelets that are the same in the new texture,

We therefore select a texture format like NV12 or ETC2; One that has the most Similar Wavelets & can therefore reduce Conversion Cost of the frame by as much as 100% (If all wavelets are the same)!

We know Wavelet types & Colour depth of all texture classes; So we will select one with a good range,
In most cases we play MP4+ Wavelets; So we can Use a JPG type texture; So all the compression wavelets remain minimally processed.

A single Frame + previous B Frame; Into a single texture of the same Wavelet Compression Classification,

The result is minimal processing CPU Cycles.

*

Overall reducing costs of higher resolution resolving; As available in 264 > 265 > 266/VVC & other Media Encoders : Rupert S


You can see that, formats such as 265 & 264 are related, Obviously at a higher resolution in the case of 265!
But in many Wavelet transform cases we can minimise the Processing cost, We do however need to know like Google's ML Voice Encoder; The ones we do not need to change (minimum benefaction)

My chief challenge of Wavelet thought is a multiple frame picture of an eye (WebP for example),
The resolution is 640x480 & We know in most probabilities that; The Eye was transformed to wavelet in HD,

So we have a wavelet curve; Black centre & A surrounding Iris!
We need to expand that wavelet so we will suppose that the higher precision version of the wavelet will add details?

We must explore how the wavelet transforms a Higher Resolution form into a lower resolution form,
We can therefore in theory use the same wavelet at higher resolving depth?

We might be able to convert a lower resolving wavelet in 12Bit into the 16Bit version & have a better understanding of the higher quality version!

We can therefore most probably reuse the wavelet; Transforming from 264 to 265 & upscale & compress more,

Overall reducing costs of higher resolution resolving; As available in 264 > 265 > 266/VVC

*
#WaveletProve Both that the wavelet is infinite & that; The Breton
shirt wavelet has a pattern represented in 12Bit but liberating into
the profound on 16Bit, 32Bit & more!

(To understand wavelet context, in textile & theory & of course Audio & Video)

Can we prove the wavelet of a Breton shirt for infinity, like mauri
My augment being that we can upscale that Breton shirt! & prove it's
17th century values...
Both that the wavelet is infinite & that; The Breton shirt wavelet has
a pattern represented in 12Bit but liberating into the profound on
16Bit, 32Bit & more!

Example Wavelets to prove upscaling is possible https://is.gd/WaveletData
*

Rupert S
*

Wavelet Upscaling : JPG / Video / Games

Example 2 Voxel to High Quality : RS


The Story : HP : V-FX Wavelet Voxel Transforms : V-FX-WVT (c)RS (Harry Potter + More)

I was wondering what to add to Wavelet transforms; Well i was thinking about Harry Potter,
Full body FX are Half Resolution; In Fact they are Depth of Field Voxels,

For people who don't know Voxel is when you make a Cube of the right shade from a picture & set it at the right depth!

For those criticizing such an act as lazy; You would have to understand how fast technology has developed!

Some characters Fly at a very low resolution & Others like Harry Potter & Melfoy Don't!

You would have to realise that V-FX is based on the ability of the person to be in the role... They perform ;-)

*

V-FX Wavelet Voxel Transforms : V-FX-WVT (c)RS (Harry Potter + More)


*
Definitions

The Wavelet is the JPG Pixel Group of a single Group of pixels at the same size as the composing Voxels of the V-FX

A Voxel is a Cube of Pixels set in 3D
*

When it comes to Transforms; This piece is called:

Transforms for classic movies : How you upscale VFX : RS

Firstly the VOXEL (Simple Wavelet Cube) needs to be compared to a fully dressed original character,

Then you need to map the correct features into The voxel cube space; After you Average Anti-Alias & Upscale the Cube Map (Original V-FX + Original Video Frame Person)

You then need to map an effective Wavelet of the Original V-FX with a modifier Layer of transparent Wavelet (The Photo in High Detail, This is also a Wavelet Series)

(c)RS

*

Example 3 : Lessons to learn : Wavelets : Upscaling (c)RS


Now about the Voxel 4x4 cube map 'Transform wavelet' is a simple JPG Wavelet
(if used properly compressed & older games did not because processors where not very fast (33Mhz)

High resolution 'Transform Wavelet' (Overlayed) is a full to higher resolution JPG Wavelet
In Upscaling we need to get from one to the other,
Transform Wavelet from Voxel Wavelet,

Sample Scaling:But supposing we have samples of like minded objects?
We can use Machine Learning to imprint a pattern!

But great looking as this is, not perfect as seen in Example 3 About Example 2 : HP!

Wavelet permutation:

Resolve the wavelet to full precision, Workable; But we need to know the result is correct!ML Can help; But that is very subjective..

Mostly this works.

Identity Follow through:

Machine Learning that identifies the subject matter [Samsung & LG TV's 2020+ Example]

So what do we do? We Add the lot! haha

Rupert S

*

Example 4 : Lessons to learn : Wavelets : Upscaling (c)RS

2 Pattern Matrix Wavelet (c)RS


Wavelets are patterns; With Colour infilling (why not a wavelet itself!

Well wavelets come in forms (Gif)8Bit, 10Bit, 12Bit, 16Bit(JPG)

We can advance the precision by using a higher Precision (16Bit, 24Bit, 32Bit); But we need to save storage space!

First thing is to use bF16 & bF32; This keeps the majority of the data from being sub pixels.

Second thing is to make maximum use of multiple Precisions, Mix F16 with F32..
Google Lyra Codec demonstrates this in Machine Learning.

Third : Keep Precision within margins, Small Textures do well in 8Bit Matrix Wavelets...
But 16Bit Colour Precision & 16Bit Precision both look good in HD High Quality HDR WCG

(Usable as encryption archetype): Chaos:A:B:T:Pi:Arc:Sin:Tan
Very usable /dev/rnd Random Ring : TRNG : GPU : CPU : Asics : Using Chaos Wavelet

{Wavelet:Colour Point) A to B as expression of Pi
{Wavelet:Colour Point} A to B as expression of Arc, Sin, Tan

[2PMW File Array]
[Header : Easy Identifier : Basic Name]
{Header Packed Wavelet Groups] [1 Image Wavelet : Colour Shading Wavelet 2, 4, 8 Group]

[Image Array lines]
|Packed Groups of] : [ Image Wavelet 1 : Colour Shading Wavelet Associations, 1 to 8]
[Packed Groups of] : [ Image Wavelet 1 : Colour Shading Wavelet Associations, 1 to 8]
[Packed Groups of] : [ Image Wavelet 1 : Colour Shading Wavelet Associations, 1 to 8]

[PG],[PG],[PG],[PG],[PG]
[PG],[PG],[PG],[PG],[PG]
[PG],[PG],[PG],[PG],[PG]
[PG],[PG],[PG],[PG],[PG]
[PG],[PG],[PG],[PG],[PG]

*

Audio/Video/Image Format : Packing Vectors (c)RS

Several choices of Interpolation; With low computation cost to higher Cycle Performance..
Depending on processor feature sets (such as NANO & SiMD & Crypto processor, Manageable in integer)

Vector Wavelet Examples : Math object

Wavelet Curve compress, Normally from left because we code Left to right & that is optimal for our hardware.
Can be numeric sequence Direction point 1=D D=1,2,3,4 2=Db = 1,2,3,4 | Displacement Dp = 1,2,3,4 Assuming Left To Right or curve displacement = Time

Distance N from source edge, Curve:Sin/Tan
(Example) D=1 Db=3 Dp1=2 Dp2=3 | Curve = Tan3+Db2

Logarithmic Pack,
Integer Comparator : N+N2+N3=N+1+2+3 | Sequence
*

Example 5 : Predict Scaling : SiMD/AVX.SSE3 : (c)RS


SiMD Interpolation grids & Predict with Raytracing & General SiMD
Reference Grid
https://science.n-helix.com/2023/03/path-trace.html
https://science.n-helix.com/2022/08/jit-dongle.html

With the Interception/Processing of Predict Statements in Frames of Video & Audio; Using a simple Grid:

Pr = Predict (motion) Px = Pixel t1:2:3 time period

PxPx1PxPxPx3
Pr1Pr2PxPx2Px
Px1PxPr3PxPx
Px1Pr2PxPxPx
Px1PxPr2PxPx

Basically you can see the pixels move in frame Px1 & Predicted in Pr2 & Pr3,
Raytracing SiMD predict future motion though maths; We can use the SiMD to,

Both predict & interpolate/Upscale from 8bit, 10Bit, 12Bit, 14Bit to 16Bit values or rather wavelets,
Because Raytracing SiMD are high precision maths; They prove advantageous if we have them; SiMD/AVX.SSE3

Interpolation : Prxi Pxri : {PxPrPi} Theory : RS


We must present a point between Px (pixel) & Pr (predict); In maths this would be a remainder,
We can draw a pixel in the Remainder Point; The Interpolation point (PI); When? When we upscale!,
We can use two principles, Px (actual pixel), Pr (Predicted Pixel), PI Pixel Interpolation!

We can guess with both Px & Pr on the content of PI & both Predict & Interpolate the pixel...
As additional Data; This does not worry us a lot.

PxPIPxPxPI
PIPxPrPIPx
PrPrPxPiPr

(c)Rupert S

*

The principle is 2 Stage interpolation with splines:


We measure points between 2 values; for examples:

Px to Px (Side by side comparison interpolation)

Px to Pr, Pr1, Pr2,Pr3 (motion & predicted content), Upward Time & Circumference Interpolation.

Px to Px2,Px3 (Time increasing potential interpolation, Both static content & calculated motion)

We calculate Pixels between 2 values.

Time content comes in 2 categories:

Predicted location & Content:Pr

Static location: Px (recorded location per frame)

Finally comes Pi: Calculated locations & Content between Pixels
*

*

Interpolation & Extrapolation Policy : RS


We can conclude Interpolation & Tessellation have requirements : 2D & 3D Spline Interpolation & Extrapolation; Gaussian methods on linear surfaces,

We extrapolate the new; Such as blade edge; We can however layout a simple grid to our supposition edge & interpolate.

We do not need to extrapolate where we have planed to draw; With so much as a 3cm polygon with 4 Lines & 2 edges,

We can however draw a fractal blade; For example : HellSinger from Elric Melbone.
*

https://sg.indeed.com/career-advice/career-development/interpolation-vs-extrapolation
Massive Datasets https://www.aimsciences.org/DCDS/article/2023/43/3&4

Python Libraries Interpolation:

15 Types
https://help.scilab.org/section_64fa3f01fdb19353faf0c6806a64a533.html

Gaussian
https://gmd.copernicus.org/articles/16/1697/2023/
https://gmd.copernicus.org/articles/16/1697/2023/gmd-16-1697-2023.pdf

SiMD Gaussian Blending & Dithering - Better_Fixed_Point_Filtering_with_Averaging_Trees
https://andrew.adams.pub/Better_Fixed_Point_Filtering_with_Averaging_Trees.pdf

Vectorization of Kernel and Image Subsampling in FIR Image Filtering
http://bncss.org/index.php/bncss/article/viewFile/101/105

Super temporal Resolution Imaging of Membrane Potential via Stroboscopic Microscopy
https://pubs.acs.org/doi/epdf/10.1021/cbmi.3c00054

Implementation of a High-Quality Dolby Digital Decoder Using SiMD MMX™ Technology
https://smtnet.com/library/files/upload/dolby-intel.pdf

JIT Compile Displacement Micromap : Interpolation & Extrapolation Policy : RS

Compress its internal geometry representations into the compressed format Just in time,
Optimizing, Allocating & de-allocating in accord with Mesh Shaders & Cache availability.

VK_NV_displacement_micromap, which for Vulkan ray-tracing can help with added detail
No Comment https://www.phoronix.com/news/Vulkan-1.3.245-Released
VK_NV_displacement_micromap allows a displacement micromap structure to be attached to the geometry of the acceleration structure,
allow the application to compress its internal geometry representations into the compressed format ahead of time.

*

Our options for interpolation (don't forget Gaussian)

bsplin3val — 3d spline arbitrary derivative evaluation function
cshep2d — bidimensional cubic shepard (scattered) interpolation
eval_cshep2d — bidimensional cubic shepard interpolation evaluation
interp — cubic spline evaluation function
interp1 — 1D interpolation in nearest, linear or spline mode
interp2d — bicubic spline (2d) evaluation function
interp3d — 3d spline evaluation function
interpln — linear interpolation
linear_interpn — n dimensional linear interpolation
lsq_splin — weighted least squares cubic spline fitting
mesh2d — Triangulation of n points in the plane
smooth — smoothing by spline functions
splin — cubic spline interpolation
splin2d — bicubic spline gridded 2d interpolation
splin3d — spline gridded 3d interpolation

*

2D-3D Spline Interpolations with background complementary colour layer smooth blend


Right on the kindle paper white 2D Spline is good for a single layer, 3D Spline is good if you rasterize a shader behind the text and shade it: The method would not cost over 1% of processing power on a 2 core ARM 400Mhz, If the image is relatively static.

On full Colour HDR WebBrowser, The 3D Spline method makes sense with complementary colour blending...
On mostly static content; 3% of total page processing costs.
On mostly Static Text with mobile images a combination of 2D & 3D Spline; 7% to 15% of cost.

interp2d — bicubic spline (2d) evaluation function
interp3d — 3d spline evaluation function

Rupert S

*

High Definition Fusions : HDF Technique:RS (use scaling references example 4+3+2)


I know that many of you, Use a machine learning based technique that enhances the sharpness & realism when upscaling,
The Voxel technique is very complementary to this view; Taking a 4 Pixel cube & transforming the look with additional details.

High Definition Fusions : HDF Technique:RS (use scaling references example 4+3+2)

I would call this technique High Definition Fusions : HDF,

4 times the size frame buffer with upscaled into the buffer..
The Second thread then loads additional high resolution samples into the buffer with a blend.

You have to observe the Details such as edges & X-OR mask the data inplace..

Merge the data with the High definition component first load & the real details loaded ontop & then Gaussian Sharpen blended & smoothed.

Ideally the sample data is from the original source in high resolution.
FSR & VSR can potentially work this way.

Rupert S

Font Scaling : RS


A really good example is downscaling a 300Pt Font into a raster image for the 8pt Version..
But we Cache a buffer with all our letters & Gaussian blend from 32Pt to 8pt,
For that we need to MipMap a 300pt Vector font.

300pt Font Cache
Rasterize at 300pt
Mipmaps : 300pt, 200pt, 180pt, 96pt, 60pt, 30pt
Gausian blend & cache at our size.

(our size is probably 96pt or 120pt by screen & 600pt & 300pt by printer)

That looks 100's of times better!

Rupert S

*

Content Adaptation Dimming Zone Technology


Remember Content Adaptation Dimming Zone Technology,
works for smaller frame buffers,

With many devices 4GB RAM or more than simply enhance 8Bit & 10Bit per channel to 16Bit Smoothing Anti-Aliasing,

Micro buffering allows much more,

Single Zone [SS][SubS] Buffers could run you into 24MB Thread Buffer with an over head initial Buffer of 256MB Write Back Cached Rewritable Main buffer

(So you can align Micro Contrast HDR WCG)

RS

*

Role : sSSubSampra Micro frame buffers a cube


The LED Brightness curve; The logarithmic voltage & WATT brightness & colour variance,
In computer chips this is basically the Response in light to the voltage & WATT input..

By controlling this upto 16Bit Dithered dynamic voltage control; usually with a modulated resister/ Transistor..
Such as a POT Potentiated differentiator; We can control the LED by altering the voltage & input WATT,

We can also input a fluctuating digital signal, A signal that we dither,

We take the signal of a SiMD or CPU & process this though a DAC or directly modulate the signal from the pin,
By analysing the output; We can produce the result of a digital waveform with reduced voltage..

Max voltage = 16Bit * 11111>111:16b , Per connection & Usually we would Access the array to DIMM Post the LED,

We can also micro array the LED Access with groups of LED & Cable.

However we need a method of pre calculating the Digital Dither to 16Bit,
But due to the RAM requirements we may be posting 10Bit from the frame buffer!

This is where sSSubSampra Micro frame buffers a cube of LED & Gaussian Dithers the colour palette & composes the group of pixels to the LED Electronics.

We can quick post from SiMD if they can post DSC codec compressed bytes to the DSC Processor or display LED,
All we need are the shapes from DSC to be available for direct posting : DIM Post to screen,

Passthrough & recompression can be optimised; Using the DMA & Codec Compression Shapes, Both to Upscale & to speed up the display,

For all we need is that DSC has shapes we can refine in SiMD; From 8Bit to 32Bit SiMD Post is potentially possible by directly DMA Writing RAM to output.

Speed differences are a few ns for a few more circuits.

RS

*

Full Screen Sync, Single Cycle Multithreaded with [SS:SubS]Method : sSSubSampra


Line post is traditional on CRT (because of single ray & analogue line by line TV Aerial signal.
Digital Age & we TV Aerial receive per frame digital compressed MP4 & H263/4/5/6 & VP9..
We still send per frame content as a line in effect,

However the Single post method requires a complete Compressed Frame; In HDR WCG 12Bit this requires considerable RAM for frame buffer...

You can output a frame; GIMP Uses 500MB of RAM for a single editable image,

With single/Multiple line DIM Post a buffer of at least 32MB would be required..
Post processing constitutes at best 1/2/3/4/8 lines & memory retention!

Full Screen Sync, Single Cycle MultiThread with [SS:SubS]Method

The outlined method is my sSSubSampra : Dimming Zone : RS Method

Because the Screen is divided into Frame Buffer Cubes : SS & Sub Cubes SubS,
We Buffer the frame & Cache & Post in Cubes [SS] with Sub Cubes [SubS]; These Cubes constitute smaller work units with smaller RAM Requirements.

4K Image HDR 16Bit x RGBA = 500MB RAM Uncompressed.
(3840 X 2160) Image / 64 = 129,600px or (60px by 33.75px) * 64

(60px by 33.75px) = * 8

[SS] = 480px by 270px
[SubS] = 60px by 33.75px

As we can see the DIM Post DMA Write is only 8MB to 16MB with full post processing multi-threaded.

Rupert S

*

sSSubSampra : Dimming Zone : RS

Technique for Dimming Zones on all LED class devices.

(MipMaps: As AMD has a great MipMap in FidelityFX!:
So what advantage in creating our own? Well, let's see!)

For a start our MipMap needs to be Higher than screen resolution!

So we need to Gaussian Sharpen to a larger frame buffer,
Then we need to Sub-Sample > Dimming Zones, So why ? So we can lighten & darken parts of the dimming zone!
We can shade [SS] Sample Zones & Sub Sample [SubS]

A screen usually needs a linear maximum & minimum light level; So we set these levels.
Divide SubS into a waveform filter with 3 to 8 levels of brightness

So what do we need? Read above!

Super Sample Frame buffer
[SS]
For [SS] = 4 [SubS] * N

Example Dimming Zone MipMap Zone
[SS][SS]
[SS][SS]

[SubS][SubS][SubS][SubS]
[SubS][SubS][SubS][SubS]
[SubS][SubS][SubS][SubS]
[SubS][SubS][SubS][SubS]

Rupert S

*

*Texture [SSSubN] : RS

sSSubTexture

[SS Texture with sub parts SubS]
N*[SubS Texture](Squares * N)
Refer to [SS/(N*SubS)]

Packed Layers for filtering

[6 * Same Size MipMap Sub Samples, Dark First with light layered ontop]
Very Light
Light
Lighter
Darker
Dark
Very Dark
*

We can treat each layer using ML, Logical Gaussian Filters & Sharpens & Colour Vividness & Clarity.

We DMA Move the frame by priority order, Dark first to very light.

DMA Move [Texture Block][ VL, L, Lr, Dr, D, VD] Very Dark Arrives first to paint; This gives us the advantage of only lightening the screen,
But we do need the entire block to be DMA Transferred in 1 to 3 Ticks; This has a flashing effect if we don't paint the order in a single frame; So we must.

Rupert S

*

MipMap Brightness Layer Example : sSSubTexture


The example of a code:

Fetch colour range (of the LED, For example Reds, Greens, Blues),
Grouped Colour range fetch saves on loads; But 16Bit SiMD can only load a small range; So a single colour,

If we have 16Bit per channel we only load one colour range per Pull,
So we perform 3; Red, Green, Blue; Or 4 Red, Green, Blue, Black..

When i say range i mean how light the pixel is; But we also blend the colour with the surrounding pixels subtly to anti alias,
To Anti Alias we need to bias colour reproduction to brightness closer to the next colour pixels,

High Dynamic contrast; Still link colour brightness so the pixels blend,
Higher contrast removes waveform similarity,

In lower contrast scenarios such as dark walls colours form in waves & therefore are smooth & able to be blended,
Lower contrast colour combinations lack distinct details & therefore are well compressed,

Sharp high contrast colours are edges & liable to be aliased; We therefore link the local pixels & subtly match colours,

For the example 7 shade MipMap we block groups of pixels into textures of different brightness; 7 levels,
We can blend & sharpen each level of brightness for optimal expression of vivid visual information.

RS

*

Content Adaptation Dimming Zone Technology can be on 2 fronts:

Display Signal Adaptive Content
The HDMI & DisplayPort signal can be Dynamically adjusted for Colour & Gamut range,
Example 8Bit/16Bit: RGB + Brightness & Darkness peak with on screen Profile & Gamma Curve

The Dimming Zone Technology can then adapt to Display Source DDC & Available ICC - Internet Consortium Colour Profile : HDR BT2020, BT2084, BT709

Directly on the display, In the firmware.

Personally I believe that with Both; We will get the best.

RS

By this technique, You are not obliged to have a Micro Dimming Array..
But obviously Quality of the screen will be higher with a Micro Dimming array!

The idea being that you can contrast and optimize all parts of a screen locally..
You will not need a Seperate Tile Micro Dimming Cable system.

You will significantly improve Micro Dimming with tiles to be honest & Improve it with sSSubSampra micro contrast & colour.

sSSubSampra significantly improves Multiprocessing of all image effects such as sharpening, Smoothing & filtering.

RS

*

Audio, Video & High precision Float ML


tensors & full onnx configuration : Upscaling : While we are not sure how much ML we need & at what precision,

We can be sure that 32Bit (per channel) Value RGBA (Multiple layer) requires at least 8Bit to 16Bit per channel final precision; So here is a list:

Required Value of output, Neural Network precision guide table: RS

Input
8Bit, 10Bit, 12Bit, 16Bit

Input network precision average bit retention (for RAM some error is allowed)
6Bit, 8Bit, 10Bit, 14Bit, 16Bit

Classifiers as we know can be,
Int 2Bit 4Bit, 8Bit, 16Bit, 32Bit
2 Bit is unlikely & 32Bit is for Dream Smooth 16Bit+ Precision output

Output Float (Mostly FP & F16b)
16Bit = { 8Bit, 10Bit, 12Bit }
24Bit, 32Bit, 64Bit = { 16Bit, 32Bit, 48Bit }
We can upscale : Audio, Video, Content & Polygons, We classify Quality by expectations & Quantify by percent %

Rupert S

*

Classifier Behaviour


F16 Compare Object Classifiers { Meta Data such as descriptors for the blind, Colour, Shape to Data }
F16:Int8 Compare Shape to table
Int8 Identify shape more subtly than Sharpen : Define Shape Sx

F16 Compare Database to X
F16 Int8 Compare edge alias to X
Int8 Define [Edge X & Compare] | Send to Edge Sharpen matrix

Set Shape to sharpen Elliptic
Sharpen or blur or Gaussian : Define Shape = Sh

Sharpen or blur or Gaussian or spline3d interpolation
*

Audio, Video & High precision Float ML : Colour palette example function


With the High precision Float ML method we are capable of offering our VESA configuration on a compatible colour profile,
sRGB, BT709, BT2020, BT2084 & widen the palette!

So why ? 2 reasons:

Gaussian blending & Bi Linear pixel blending; We require a very subtle palette to bled well,
But we do not have RAM & processors to burn!

Gaussian blending is efficient in the [SS][SubS] Pattern; Where we are dealing with patterns of Micro Dimming & adaptive contrast..

The smaller pattern & Brightness MipMap layers mean we can blend layers as we need,

Dark zones for example are noise hell; So we can Gaussian them; But we can sample details.
Brighter parts of the image are sure to have details that we need; But we handle each layer within the matrix..

Smaller RAM Loads & faster Writes, Better Caching per frame.

Rupert S

*

Quad pixel is part of the texture format.


As described in Example 2, 3, 4

The principle of how to work a Quad or Ten Pixels into a shape,
Easier to describe in texture format words; A shape is made in a SiMD to be sent to a group of pixels,

Grouping pixels means fewer DMA transfers; Because a SiMD is , 8bit, 16Bit, 32Bit, 64Bit..
Both the shape & the shade are described in a single request..

Alternatively the pixel is subject to higher precision colour (64Bit for example); Therefore we can smooth blend with subtle shading & colour,

We can also send 2 frames per send if we divide the SiMD into two lower precision parts..
But we have to receive the DMA as if we are interpreting 2 lower bit Integer/Floats; As of:

Integer floats with remainder theory :

https://science.n-helix.com/2021/02/multi-operation-maths.html

https://science.n-helix.com/2018/01/integer-floats-with-remainder-theory.html

Wavelet Formation, Write [Px2] from [Px1] overlap as required by motion:
Write round in sequence or Write [Px1] Centric Texture to [Px1]>[Px2]

[DMA]
[Px2][Px2][Px2]
[Px2][Px1][Px2]
[Px2][Px2][Px2]

Method 2 [DMA] write [Px1][Px2][Px3] & more as required & repeat (Example SiMD 64Bit = 4 x 16Bit)

[DMA][Px1][Px2][Px3]
[DMA][Px1][Px2][Px3]

Rupert S

*

Feature Properties Meta Data Tables & Tags DDC


LUT Colour Capacity Properties

Important Colour & LUT Caps for AMD
https://www.phoronix.com/news/AMD-Color-Steam-Deck

This reminds us to expose Caps both towards & from the OS & HDMI & DisplayPort,
Caps are exposed by the display in the form of LUT Table ICC such as BT2084, BT2020, BT709,

Obviously the GPU selects LUT Tables such as BT2084 from the HDMI port,
But what about exposure of colour caps from the GPU to the Display ?

The method of mutual lock for colour palette is a sure win,
Exposing additional capacities such as JIT Compiler, OpenCL, Vulkan & Direct Compute; Directly to the display!

But Why? Acceleration & Colour qualities; For example exposing the LUT Compiler from the GPU Directly to the display in DDM Immediate mode ALLM,

Colour & Cap exposure Would improve Colour rendering & additionally allow the displays to directly process LUT on the GPU,
Other features exposed through meta data could & would improve total rendering capacity & also utilise more of the DisplayPorts Capacity & bandwidth assignment.

RS

*

Upscaling & FMA
https://science.n-helix.com/2023/06/map.html

For when {(A+B/2)} = C Expressions https://is.gd/ForWhen_ABx2_C
For when {U, X, Y, Z} = N Expressions https://is.gd/ForWhen_UXYZ_N

*

Basic Upscaling Kernel Starter Set, Contains a basic set of what we hope to achieve.

Learning from proverb; Future Productions inc

OpenCL Kernel Builder
https://drive.google.com/file/d/1d_bWbZl9fAZXsLbN_jZdqSxdWzraLSIz/view?usp=share_link

Texture Encode Source
https://drive.google.com/file/d/1udWU4slmZkUGcagcJl1KwFWh5FJ5ScoN/view?usp=sharing

FSR Scaler
https://drive.google.com/file/d/1D27MOBYKVkKib1JzP_eFucp8RRrzAhd6/view?usp=share_link

Python ML Image denoisers, Very heavy denoising
https://github.com/cszn/BSRGAN
https://github.com/cszn/SCUNet

Crucial Codec source for projects
H266 https://drive.google.com/file/d/1Zt0CrP5p8ld7xnki1B9X4wz6Opyv13aH/view?usp=share_link
AV1 https://drive.google.com/file/d/179pqqS36v--t_BDjyhe1x_oVeYuxkWBw/view?usp=share_link
AAC https://drive.google.com/file/d/1YJy1yAdmEdjSMhtUjvTEU-y9HqJXFzzN/view?usp=share_link
LC3 https://drive.google.com/file/d/1_Gnf_PLN81YepCugmaRNofib7zLOHBNO/view?usp=share_link
DSC https://drive.google.com/file/d/1hbTFsFqzQTqLbhOaEwY-QkM4y3uAglXX/view?usp=share_link

X86Features-Emu
https://drive.google.com/file/d/15vXBPLaU9W4ul7lmHZsw1dwVPe3lo-jK/view?usp=usp=sharing

Upscale DL
https://is.gd/UpscaleWinDL

https://is.gd/HPC_HIP_CUDA

https://github.com/GPUOpen-LibrariesAndSDKs/RadeonML
https://github.com/GPUOpen-LibrariesAndSDKs/RadeonImageFilter

https://science.n-helix.com/2022/10/ml.html

*
https://github.com/ssube/diffusers/tree/feature/onnx-upscale

https://github.com/huggingface/diffusers
https://huggingface.co/ssube/stable-diffusion-x4-upscaler-onnx

https://huggingface.co/uwg/upscaler/tree/main
https://huggingface.co/nvmmonkey/optimal_upscale/tree/main
https://huggingface.co/gmp-dev/gmp-upscaler/tree/main/ESRGAN

Neural Engine
https://github.com/godly-devotion/MochiDiffusion

ML List & Services
https://huggingface.co/models?sort=downloads&search=upscale
https://huggingface.co/models
https://huggingface.co/pricing
*

Cubic SubSampling reference : 

https://science.n-helix.com/2023/03/path-trace.html
https://science.n-helix.com/2023/02/smart-compression.html

In simple principle SubS uses Probable interaction PDF & Ray Boxing (Isolated Cell Cube = [SS]/[SubS]),
We only therefore only need to Predict Sample for likely cube overflows into adjacent boxes.

Resampling first; As we are resampling a ray box for probable intersection with our primary target (viewer),
Our motive is that the viewer is the only one to see the rays; Only Science project need to know all; But not always,

We need a sample that does interact with the Observer/Viewer!
So we simply need a bounding box with a direction mesh (multiply by X) that shows probable cause to interact!

We know that Viewer X is the only person seeing that interaction & So we know that if we point a triangle towards a light source; We directly interact with a subsample array,
We do not need them all!

PDF Similarity is used with the Ray Box to allocate work to probable cause; Located at User interaction AKA Observer/Viewer.

https://gpuopen.com/download/publications/Efficient_Spatial_Resampling_Using_the_PDF_Similarity.pdf
https://gpuopen.com/download/publications/I3D2023_SubspaceCulling_updated.pdf

*

ReSTIR Additions


Super Sampling is a technique of loading a texture; Upscaling the texture into a 4x to 8x larger size Cache,
Lacroze & Gaussian Blends combined with sharpening (Also available in AA & Gaussian Sharpening & 3D Spline Interpolation),

Added to sharpening & upscaling is Bi & Tri Linear Interpolation..

Interpolation requires that you estimate points between pixels in the texture or image..

The implementation of Method Example 1 to 4 including Mipmapping [SS][SubS] Frame buffer With Multithreading Micro Framebuffer Groups..

Allows Super-Sampling with Micro-Block Frame Recursive & Forward temporal Predict.
The simple storage of a frame in advance enables the technique,
Once a frame is in the buffer the next frame is managed with:

Included Recursive & Forward frame interpolation.
Sharpening & Image Gaussian Blend, Sharpen & Sub-Sampling Anti-Alias

In the Micro Frame Buffer & Texture Context & Full Frame colour, WCG & HDR Quality optimisations.

Interpolation methods include:

Bit Average differential at higher DPI
Gaussian blending at a higher DPI & Sharpening

Both methods have an additional Method: ML Identify & Classify ResNet

ML Identify ResNet; Identifies the Shape intention & Classifies the object by content.
We can guess that a nose is angular down for example or that a Square will stay square..
MetaData containing the identity of objects helps a lot in classifying.

ML_iRN Resolution Upscale & Texture Scaling

Texture 256 | Texture buffer Size * N +

{
3D Spline Interpolation,
Gaussian,
AntiAlias,
Lacroze
}

Texture Buffer Final | Size * N

(c)RS

*

2 Main approaches to Pixel Blend Dither : RS

Strict Clarity; Very low blend count
Alpha Blending; Under 20% colour differentiated Rendering; In fonts as an example the most recommended is 30%.

Strictly speaking a blend with more than 20% colour from a predicted location of adjacent pixels is garish,
Far too blatant & Directly inaccurate..

Potentially 3% to 7% pixel blending is quite subtle on 1024x768 & lower down to 1%,
I have a great deal of experience optimising such displays as Combined signaling on wave generators & Radio..

The Super-Sample Technique; 1% to 7% colour & Luminance & Contrast differential AntiAlias & Super Sampling + mild sharpening (light settings 1 of 10)

So yes blends of low potential difference make quite a lot of difference to perceptions of quality; Combined with subtle sharpening & AA.

Pixel blending & Sharpness in context of Average Pixel density

On a 1024x768 Display Pixel blending from a range of 1 Meter just works as a method,
At the pixel density of a 1024x768 display discolouring an adjacent pixel with a complementary colour for a predicted sub-location...

Alpha blending in effect works for real on an HD 1200x768 resolution or more quite well!
On a display with a lower pixel count than 1024x768 ; A pixel is either Yes or No.

96px or greater & 720pHD or greater & Pixel blending works well,
The higher the resolution is & the larger the distance is to view & the better this works as a method.

Rupert S

*

Pixel format optimization : Pix-AL


RGBA & BGRA, We obviously load the texture in 3 colour layers & therefore create an optimal map for dithering & smoothing purposes..

Natively aligning all colours to their corresponding pixel bit, Blue, Green, Red..

Perfect!

Examples:

RGB Offset R0.0, G0.5, B1
RGB
RGB

BGR Offset B0.0, G0.5, R1
BGR
BGR

In such examples the textures are aligned | Align = 0.0, 0.5, 1.0
Pixels consist of Arrays of colour; We align the colour Mipmap & thus sharpen the Texture & Video,
VESA DSC Codec particularly.

Remember CRT, Plasma & LED TV's Had alignment firmware automation with analogue..

We automate the prospect of aligning Pixels by Colour with Texture formats such as:
DSC, DXT5, ETC, ATC, PVRTC, ASTC & DTX Compression for display frames.

RS

*

Colour like the angels SESW16 & SESW32 for pixel (c)RS


When it comes to demosaicing you are obviously aware of pixel grid from raws; As a photographer myself; I of course researched such topics..

But you do need a grid to demosaic; In LED displays you think you need to demosaic; But in reality you are purely mixing light though the spatial anomaly called Air blur,
But you really need to demosaic the content of the pixel colour; As the primes red, Green, Blue & shades of them are present in each patch..

The principle is that when you call a pixel write; You are improving visual quality prioritizing each colour components priority for processing or patch processed:

*
To explain patched, see the process of screen write; To either write by line or groups of lines &or Patch Cube,

In that you group segments of the Screen DMA refresh into either line writes or Squares in order to make the write process faster during a frame flip.
*

mosaicing reasons, Filtering pure colours for less distorted purity,
Example common configurations of pixel:
RGB
GBR

when we post an LED group such as this we can use two methods:

single energy spike with wavelet
Colour Encoded energy Spike with wavelet

In principle single energy spike allows a full range 16Bit FP SiMD to post full range channel data with no overheads; But we need 3 R G B

In principle 3 RGB code colour DAC Spike needs to be 48Bit to be 12Bit/16Bit Sensor...

So in effect as demonstrated here in my thesis; Single Wave 16Bit spike SiMD x 3 but filtered per colour = 16Bit x 3 or 48Bit

Data wavelength reduction reduces the method to one simple thing? How to do a patch with 16Bit Energy representation only ?

single energy spike Wavelet : SESW16 SESW32 and so on.

Colour like the angels SESW16 & SESW32 for pixel (c)RS

Rupert S

*

Pixel Order: RGBA & BGRA & RGBA_f16 BGRA_f16 : RS Applies to Video & Audio rendering & Delivery


Single F16 pulse per R,G,B, F16x3
F16 Wave32 10xRGB & 2 Control Filter F16
F32 Wave32 10xRGB but 2x F16 & 2 Control Filter F32

Depending on how much control a device has over how to draw/light a pixel...
The Method of prioritising the colour that is mainly processed; May have advantages.

In principle with my display as the example; If we light all 3 colours at the same time per pixel then...

BGRA is the apparent order; So a DMA Paint with {BGR}A is going to sharpen the colours in that order & also filter them for noise in that order!

So what effect does this have ?
All 3 colours can be drawn in a single pass; Although we could separate 3 Passes per frame if we want...

3 mono passes with 3 Layers of pure F16 R G B; In my case B G R,
Alternatively 3 Pure F16 Energy Rating pulses for the 3 colours per pixel.

What is the relevance of F16 x 3 ? Can't we do an F16 Palette with all 3 colours ?
Well no because my display is 10Bit so the output would be 30Bit!

30Bit is a lot more complex to produce from SiMD; Probably use 32Bit SiMD..
So not a problem; But all 3 colours would process in 32Bit & that is more work.

F16 Wavelet pure energy levels x 3 R G B means at most 3 cycles or 3 SiMD Units,
Bear in mind that SiMD is anywhere from Wave32 32 Operations, Wave64 64 Operations..

We could use F16 Wave32 and colour 10 Red, 10 Green, 10 Blue per cycle

Single F16 pulse per R,G,B, F16x3
F16 Wave32 10xRGB & 2 Control Filter F16
F32 Wave32 10xRGB but 2x F16 & 2 Control Filter F32

So in essence F16 makes more sense depending on what hardware we use,Filtering each colour in pure F16 Identically & with the same Shader!
More precise than F32 / 10Bit per R, G, B & 2 Control bits.

Rupert S

*

6 Way Matrix Spline Interlace Multiplier : RS


Matrix consist of a 6 way La{1:2:3} to Lb{1:2:3} Edge Detect & then interpolation with smoothing,

Edge detection promotes importance of aligned colour points,
This is called a 6 Way Matrix with Lma favouring La & Lmb favouring Lb; Lm is Lma 50/50 Lmb

Maths Basic
1920x1080 = 1918 lines | 6 way + 2 sides (up down) 3 Way & 1078 | 6 Way + 2 sides (Left Right) 3 Way

La{1:2:3}
Matrix Interpolation 1 to 3 lines
Lb{1:2:3}

               La 1 2 3
Matrix Lma 1 2 3
Matrix Lm 1 2 3
Matrix Lmb 1 2 3
               Lb 1 2 3

As you can see the matrix is 6 ways on real lines & multiplies or doubles lines,
Matrix can be 1+1/3, 1+2/3, 1+1, Etcetera.

This method is relatively simple & fast.

Rupert S

*

Identified material ML Shaped Edge Detect Gaussian sharpen & blend (c)Rupert S

When i say fast i mean, MOV {X,Y}stack | add {x + y} DIV {xy}/2 | MOV {stack} {FrameBuffer} :
I am afraid that this is about as FAST as Good upscaling under 2Ghz gets

Suitable for WebASM & WebGPU

Basic thought Upscaling ASM : RS

MOV {X,Y:X2,Y2:Xn,Yn}FrameBuffer1
var upscale = add {x + y} DIV {xy}/2 | MOV {XY + upscale}FrameBufferTemp
MOV {FrameBufferTemp+FrameBuffer1} LOC {FrameBuffer2(FrameBufferTemp+1 FrameBuffer1+0)}

Var table1 = input
Var table2 = interpolate
Var table3 = output

Var xy = 2/(X+Y)
For var table1 = {X1, Y1 : Xn, Yn} Then Var table2 = xy{X1, Y1 : Xn, Yn};
Then table3 = ({table1X + table2X+1} + {table1Y + table2Y+1})

*example
the tunisian & Ukraine low resolution cam footage has too low a frame rate for eye or hand motion
*

Interpolation and has lines, probably less than 25fps, the clear minded need to double the frame rate

So strangely enough, Double frame rate by copying predict frame & upscale the in-between frame before; Upscaling the previous frame & future frame with frame to frame interpolation & sideways & line to line inter-predicted interpolation..

Inter-predict interpolation sounds like a CPU heavy configuration; However the use of Gaussian (heavy or light precision) & spline interpolation, both temporal & resizing...

Because applied on top of or under the Identified material ML Shaped Edge Detect Gaussian sharpen & blend,

not too CPU heavy on 2Ghz+ AVX2 / SiMD / NANO

*

Example use of upscaling of non uniform size


One way to use this is if you want to change the Vertical/Horizontal plane so that it is more dense,

With the MicroLED and MicroLensing formula; you may require something more than a long LED Pixel..
So the 6 Way Matrix is ideal when you simply want to resize the image in one or two directions...

While the screen is still stated as 16:9 for formula; You might have square LED Pixels!
But you still would prefer not to use a lot of CPU for it; Mind you if you operate per line HD is still 1080 operations per frame.

But if you output 16 Lines per send & overlap the last 2 lines in the next write cycle:

1920x1080
77.14 14Line Writes + 2 lines of overlap for 16 Line write.
AKA 77 dimming zones vertical; With as many modifications as you want on line write.

This stops banding, printers show the effect of overlap printing but screens are the inverse; In printing we print slightly less in the banding area.

RS

*

MathML & scaling


https://www.w3.org/TR/MathML2/chapter2.html
https://developer.mozilla.org/en-US/docs/Web/MathML/Examples

Additional scaling example is the recently introduced MathML & scaling available in chrome source

In reference to scaling in displays & fonts we have two additional sources of internal resolution enhancement,
At least in terms of web browsers and Use Interfaces UI

With the two manageable systems we could potentially do quite a lot without increasing bandwidth costs..

Scaling down slightly higher resolution fonts & images & videos; To stunning details!

To be frank MathML appears not to be machine learning optimised; However in CSS markup we could use MathML..
To dynamically scale a webpage to DPI & Size & where preferred to a lower scaling & thus improved readability!

If we can take scaling as automatic input & read the results internally we could do quite a bit with it,
However MathML is quite good for things like price range conversion : £ Euro & $ & yen

we can use MathML quite flexibly; But is it a calculator ? It should be,
So we shall see!

RS

MathML is not only useful for displaying mathematical content, but also for performing calculations and conversions..

This means that mathematical content can be displayed at any size and resolution without pixelation or distortion.

Another example of scaling in displays and fonts is the use of internal resolution enhancement techniques, such as subpixel rendering and antialiasing,

These techniques improve the appearance and readability of text and images by smoothing out jagged edges and enhancing contrast,

For instance, MathML can be used to convert between different currency units, such as pounds, euros, dollars, and yen.

MathML can also handle complex calculations involving fractions, roots, powers, trigonometry, and more.

RS

*

TOPCloud Scaled Flexible WebASM & WebGPU & MathML!


Quite flexible for use on Monitors & TV's; Light processor load on simple tasks & offloadable such as TOPCloud!

You may be thinking Offloading is impracticable because that requires one of two things:

JIT Compiler Dongle..
USB device such as Firestick or GPU & CPU (With OpenCL Compat)

Server! so internet & service provision!
Impossible? No; WebAdvert supported TV's need both!
So why not HPC TOPCloud? could make a HOT TV a lot cooler & Eco friendly with Server repeating tasks:

Scaling
Quality Service
Service availability

TOPCloud Offload Logic:

In terms of WebASM & WebGPU & MathML; TOPCloud provides sufficient advantages to be considered a core utility..

While Offloading repeating content such as Siteload core stack (Server) & Localising configuration such as Webpage size & DPI & Dynamic font arrangements that require thought.

In terms of Offloaded function & Efficient system load for large configurations..

Especially efficient configurations such as TPU, Coral, GPU work & Cloud CPU that have large optimised stacks & installed drivers.

RS

*

3D Matrix Web Codecs


Are presented as being JIT Compiler re-encoded when required; Frequently WebASM, WebGPU Code, JS...
Audio, Video, Sensation, Code Runtimes.

Web Codecs for devices are a modern concept & are available for common websites such as news & music,
devices such as Alexa Echo & Google Dot & Bluetooth Devices?

Media players & BT devices particularly suffer from small Storage potential!
So Web Codecs downloaded to the device from a source; Such as a smart phone or computer..
Are a clear-minded solution!

JIT Compiler

3D Matrix Tables in FMA, Mul & ADD code to be automatically recompiled locally when required!
Directed to a common API, Direct Compute, WebGPU, WebASM, Jit Compiler OpenCL

Many Operations can be done from unique device specific optimisation; Examples:

API, DirectX & OpenCL & Vulkan & WebGPU & WebASM
Texture & Audio Shaders.
Digital Streaming

Bluetooth NANO SiMD & API
Digital TV in H266, VP9 & AV1,

Locally compiled accelerators should be respected first; Such as the output & input 3D Matrix & CPU & GPU Acceleration engine..

Code can include Matrix converters into common output format such as WebP & Textures & BC, DXT Compression presentation; Vulkan, OpenCL & DirectX & Texture & Audio Shaders.

Java, JS & WebASM are examples with operator mechanisms & JIT Compiler optimisation..
Minimising storage requirements for good compatibility while maximising performance.

RS

Requirements:

https://science.n-helix.com/2022/08/jit-dongle.html
https://science.n-helix.com/2022/06/jit-compiler.html

https://science.n-helix.com/2023/02/smart-compression.html
https://science.n-helix.com/2022/10/ml.html
https://science.n-helix.com/2023/06/map.html

*

Scaling; We can classify by colour or creativity. (c)RS


If you use TOPCloud, you can share between different displays in the TOP's Sense..
but mostly you would need cloud presence,

Mostly this would be about making the most out of TOP heavy Business GPU & personal ones in your computer or consoles.

But sharing common tasks such as scaling movies by type or by identifying a single movie to upscale...

Now you might be asking what we would be doing there?
Well a single movie uses the same materials in our ML; We can analyse the class & optimise the scaling by class..

For those familiar with games & FSR; We familiarise our code with a single game!
By doing this we improve our product and can therefore classify by:

Resolution
Style
Speed
Type, FPS for example & RTS

We can classify by colour or creativity...

We do not simply have to roll the dice on General Scaling, We can use classifiers:

Title
Scale
Type
Speed
Frame Rate
Colour & Composure

Rupert S

PoCL Source & Code
https://is.gd/LEDSource

*

Vector Instructions
https://science.n-helix.com/2023/06/map.html

https://science.n-helix.com/2022/08/simd.html
Vector Encoding : VECSR https://science.n-helix.com/2022/04/vecsr.html
https://science.n-helix.com/2019/06/vulkan-stack.html

*

Specification for Open Compute & Gaussian Interpolation & JIT Compile
Displacement Micromap : Interpolation & Extrapolation Policy : RS
https://science.n-helix.com/2023/02/smart-compression.html

Concept of JIT OpenCL
https://science.n-helix.com/2022/08/jit-dongle.html
https://science.n-helix.com/2022/06/jit-compiler.html

Demosaicking DoFP images using edge compensation method based on correlation
https://opg.optica.org/oe/fulltext.cfm?uri=oe-31-9-13536&id=529002
https://iopscience.iop.org/article/10.1088/1361-6501/accbdd/pdf

FPGA 'Xilinx Virtex-II' HPC application Multiple-Applications & Image-Net & Matrix-Multiplication - H-SIMD machine _ configurable parallel computing for data-intensive HPC
https://digitalcommons.njit.edu/cgi/viewcontent.cgi?article=1836&context=dissertations

A SIMD architecture for hard real-time systems
https://www.repository.cam.ac.uk/bitstream/handle/1810/315712/dissertation.pdf?sequence=2

Multiple Parallel SiMD Single Cycle - A Multi‐instruction Streams Extension Mechanism for SIMD Processor
https://ietresearch.onlinelibrary.wiley.com/doi/pdf/10.1049/cje.2017.09.013

Ideal for 4Bit Int4 XBox & Int8 GPU
PULP-NN: accelerating quantized neural networks on parallel ultra-low-power RISC-V processors - Bus-width 8-bit, 4-bit, 2-bit and 1-bit
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6939244/

Vulkan's,
Useful for Presentation with AA, work Rendering/Upscaling Shaders
https://drive.google.com/file/d/1KxxKRLOH01m5IYqAy9DeR9qq8gHIEdSs/view?usp=sharing

OpenCL, Hardline minimal code kernels,
(Code AA Processing yourself) Useful for work Rendering/Upscaling Shaders
https://drive.google.com/file/d/1SYLr0JwWD-DbbXHsrANxkFe2hBrn1cZf/view?usp=sharing

Shaders; Useful for texture cache & presentation
CL Shaders 2
https://drive.google.com/file/d/1c2K5GooOKY-kPHxiqc27A_l3pkcYxvZU/view?usp=sharing
V1.6 Shaders
https://drive.google.com/file/d/1C3Q9-LvB0T8p6XHpoZynttxuV2Eunwg2/view?usp=sharing,

Gaussian Interpolation, Useful for upscaling & AA
https://drive.google.com/file/d/1sjMpGVhvULsSloeoQ_zikzX2AzZlUBtY/view?usp=sharing

Texture Encode Source
https://drive.google.com/file/d/1udWU4slmZkUGcagcJl1KwFWh5FJ5ScoN/view?usp=sharing

*
Image Optimisation Training Datasets:(Download Folder to directory)

Upscaling Training Sample Set:
https://drive.google.com/drive/folders/16Z0izDX0JyajyLgWbH0E2W-RyKv_CckT?usp=sharing
Upscaling Training Sample Set, Eco Samples:
https://drive.google.com/drive/folders/1_gUJ4F9ibQWCMFX1IDSv708vA7-bmNCp?usp=sharing
Space Training Samples Set
https://drive.google.com/file/d/10lHycalqZFmsp_gwE5ym47GbEDv36pZJ/view?usp=sharing
*

*

https://is.gd/WaveletData


Python ML Image denoisers, Very heavy denoising
https://github.com/cszn/BSRGAN
https://github.com/cszn/SCUNet

To Compress using CPU/GPU: MS-OpenCL
https://is.gd/MS_OpenCL
https://is.gd/OpenCL4X64
https://is.gd/OpenCL4ARM

PoCL Source & Code
https://is.gd/LEDSource

Khronos-1.3Extens
*

The Smart-access


[Innate Compression, Decompression, QoS To Optimise the routing, Task Management To optimise the process] : Task Managed Transfer : DMA:PIO : Transparent Task Sharing Protocols

The following is the initiation of the Smart-access Age

https://science.n-helix.com/2023/02/smart-compression.html

Vector Encoding : VECSR https://science.n-helix.com/2022/04/vecsr.html

QoS To Optimise the routing:Task Management To optimise the process

https://science.n-helix.com/2021/11/monticarlo-workload-selector.html

https://science.n-helix.com/2023/02/pm-qos.html

https://science.n-helix.com/2021/10/he-aacsbc-overlapping-wave-domains.html

https://science.n-helix.com/2023/03/path-trace.html


FPGA 'Xilinx Virtex-II' HPC application Multiple-Applications & Image-Net & Matrix-Multiplication - H-SIMD machine _ configurable parallel computing for data-intensive HPC
https://digitalcommons.njit.edu/cgi/viewcontent.cgi?article=1836&context=dissertations

A SIMD architecture for hard real-time systems
https://www.repository.cam.ac.uk/bitstream/handle/1810/315712/dissertation.pdf?sequence=2

Ideal for 4Bit Int4 XBox & Int8 GPU
PULP-NN: accelerating quantized neural networks on parallel ultra-low-power RISC-V processors - Bus-width 8-bit, 4-bit, 2-bit and 1-bit
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6939244/

Transversal processing availability : Transparent Task Sharing Protocols


https://science.n-helix.com/2022/08/jit-dongle.html

https://science.n-helix.com/2022/06/jit-compiler.html

Machine Learning


https://science.n-helix.com/2022/10/ml.html

https://science.n-helix.com/2021/03/brain-bit-precision-int32-fp32-int16.html

Innate Compression, Decompression


https://science.n-helix.com/2022/03/ice-ssrtp.html

https://science.n-helix.com/2022/09/ovccans.html

https://science.n-helix.com/2022/09/audio-presentation-play.html

https://science.n-helix.com/2022/08/simd.html

Strobe Line by Line Run Length Compression DVB, NTSC, VESA :RS Approved
https://drive.google.com/file/d/148-BpVSfT6bA5nPjKoiZ41vwuI9n7P_f/view?usp=sharing

Examples of compression
https://godotengine.org/article/betsy-gpu-texture-compressor/
https://github.com/darksylinc/betsy/blob/master/Docs/technical_doc_advanced.md

*
Gain the diplomacy of a Scaler Scailing Cause : News on the Gaining
Diplomacy World Wide
https://drive.google.com/file/d/1OfG8X_PuqAyICbI-wrLar2trSiz5kFix/view?usp=drive_web

Gain the diplomacy of a Scaler Scailing Cause : News on the Gaining
Diplomacy World Wide 2
https://drive.google.com/file/d/1T5Qx_k9EIousRox0H7sixkgEWmBINQIB/view?usp=drive_web
*

Sound Open Firmware : Supported by Intel, AMD, Realtek, MediaTek, DTS, Dolby, RS & so-on
S.O.F provides an open source audio DSP firmware and SDK for audio or signal processing on modern DSPs

https://thesofproject.github.io/latest/algos/index.html
https://www.sofproject.org/
https://github.com/thesofproject

*****

Good stuff for all networks nation wide, the software is certificate signed & verified
When it comes to pure security, We are grateful https://is.gd/SecurityHSM https://is.gd/WebPKI
TLS Optimised https://drive.google.com/file/d/10XL19eGjxdCGj0tK8MULKlgWhHa9_5v9/view?usp=share_link
Ethernet Security https://drive.google.com/file/d/18LNDcRSbqN7ubEzaO0pCsWaJHX68xCxf/view?usp=share_link

These are the addresses directly of some good ones; DNS & NTP & PTP 2600:c05:3010:50:47::1 2607:fca8:b000:1::3 2607:fca8:b000:1::4 2a06:98c1:54::c12b 142.202.190.19 172.64.36.1 172.64.36.2 38.17.55.196 38.17.55.111

*****

Andro-linux libs : x86 & ARM : Learn
https://drive.google.com/drive/folders/1BRQOIK1eAUEMnTTGjsQ0h0g6jGLzWqZI

Python Deep Learning:

AndroLinuxML : https://drive.google.com/file/d/1dVJHPx9kdXxCg5272fPvnpgY8UtIq57p/view?usp=sharing
Linux : https://drive.google.com/file/d/1u64mj6vqWwq3hLfgt0rHis1Bvdx_o3vL/view?usp=sharing
Windows : https://drive.google.com/file/d/1dVJHPx9kdXxCg5272fPvnpgY8UtIq57p/view?usp=sharing

good stuff for all networks nation wide, the software is certificate signed & verified
When it comes to pure security, We are grateful https://is.gd/SecurityHSM https://is.gd/WebPKI
TLS Optimised https://drive.google.com/file/d/10XL19eGjxdCGj0tK8MULKlgWhHa9_5v9/view?usp=share_link
Ethernet Security https://drive.google.com/file/d/18LNDcRSbqN7ubEzaO0pCsWaJHX68xCxf/view?usp=share_link

Thursday, February 23, 2023

PM-QoS - Processor Model QoS Tree for TCP, UDP & QUICC

Quality of Service Protocol & the TCP & UDP & QUICC Protocols : RS


Extremely good for HDMI & DisplayPort & USB/URT & 2.4G/Bluetooth : In regards to Codec development and flow & device control,
Audio, Video, Process & Command

https://www.ietf.org/archive/id/draft-scheffenegger-congress-rfc5033bis-00.txt

Congress - Congestion Control - Combined Network QOS Routing Table Tree-Swarm - Quality of Service Protocol & the TCP & UDP & QUICC Protocols

*

Processor Model for TCP, UDP & QUICC : (c)RS


To put TCP, UDP & QUICC in a proper place in your minds for application,
Think about Applying them to processors; Particularly Neuromorphic, ML & GPU/CPU!

How exactly?

Address space modelling for data transfer:
Between RAM, HDD/SDD & CPU & Internally mapping across cache & Sparse Model NAND Gates.

In the situation internal to Device Gates & Logic Circuits; We map address spaces across the processor,
We internalize the location logic as a network & utilise TCP, UDP & QUICC,

We do not need the sending strategy of Data Transfer to be Random; Random wastes Bandwidth!
But we do need a QOS Data Transfer policy & Networking Tactics!

Why ? Not all processor functions are directly connected in MultiChip & 3D Model Processor.

*

By thinking about the Processor Model for TCP, UDP & QUICC : (c)RS

We soon find the best light TCP, UDP & QUICC Network Strategy.

Think about this model designing the Network Protocols

RS

*

"Kevin Cisco-Kevin

Date: Tue, 21 Feb 2023 08:32:03 -0800

Subject: Re: To think about the Network Model : Processor Model for TCP, UDP & QUICC : (c)RS

What we really need is a transfer layer mechanism modeled after Swarm

where packets are broken up into chunks and reassembled after

handshaking. But we don't live in that world."

Kevin Suggests we think about Swarm : RS : What do i think on average (Swarm)

PM-QoS - Swarm : Networking TCP UDP QUICC NTP DNS


I think that Swarm; Multi Target Networking is a primary method under consideration for QUICC & UDP & NTP Responses,

Swarm is high noise; High Volume Send & Receive,
With alteration though Statistical & Machine rout optimisation... That bandwidth cost reduces,
ML : Neural network, Send, Receive & Confirm, Swarm, In effect on globally predictable commodities such as:

NTP, DNS (popular), News & Decentralised command...

Can work! Network Command requires directly applied logic; What i mean is : Confirmed Command & Reception affirmation & Action!

So i propose the following:

Combined Network QOS Routing Table Tree-Swarm : CNetQSRT-Tree-Sw : Rupert S 2023-02

QOS Applied to QUIC, TCP, UDP Data packet Anagrams

What I mean is that QUIC is a protocol that passes data through multiple network adapters like a tree,
What we do is send information on the data transfer abilities of each adapter (locally) & prefer a route,
We prioritise routes based on data flow statistics & choose thereby optimum routes...

By Statistically collating data locally (in network adapter group, per localised network...

We will further select a route based on those statistics; Machine Learning is not obligatory & hence there is less to go wrong,

Routers do not need to be as modern & We can collect that information for routing tables & Create Optimum routes; Like a tree; With little need for control or modification...

All TCP, UDP & QUIC & NTP & DNS packets get to the required destination fast & with low latency.

QOS is clearly of advantage to QUIC, Because we can assess the data throughput of the modems/Network adapters & change routes? 
For optimum performance & minimum error or work.

Swarm:ML (Known Receiver : Known Sender)

QOS
NTP
DNS Global Submit

Network Tunnelling, For example: Torado, Large Download Acceleration

Secure Network Tunnelling, For example: VPN, VPS, Ethernet, 3G, 4G LTE, Volt, 5G Volt, Telecommunications Networking & GPS)

Defined routing with QOS Network optimisation (Localised) & Data bandwidth data (Localised)

Global Zone routing through tables...

Statistic Enhanced Routing & Delivery

*

QOS : Quality Of Service protocol : RS https://is.gd/LEDSource


Personally I believe QOS : Quality Of Service protocol be introduced
to all network traffic,
Primarily the Point A to point Z route needs planning first.

QOS Datagram
Network throughput Capacity of the network card
Advertise Capacity in local network
Plan routes based on network capacity

So the Quality Of Service Protocol needs to send a datagram to the
network adapter of site:

A to Z

A list of local routes needs to be cached & prioritised based on
Network point A's network capacity & priority,

The rout needs Point A to Z mapped & Z to A

We then send data with a packet listing preferred routes

[QOS][Origin : Target][Preferred route list forward sent][Network Performance Metric Packet]

[Origin : Target][Preferred route list forward sent][Semi Static Route Tunnel]

[Packet ID][Origin : Target][Data Packet]

Searching for a route with every packet costs processor Cycles; So
preferred routes need to be tunnelled & Secured with TLS

Rupert S

https://is.gd/CryptographicProves

Vectors & maths
https://science.n-helix.com/2022/08/simd.html
https://science.n-helix.com/2022/04/vecsr.html
https://science.n-helix.com/2016/04/3d-desktop-virtualization.html
https://science.n-helix.com/2022/04/vecsr.html
https://science.n-helix.com/2018/01/integer-floats-with-remainder-theory.html

Networking & Management
https://science.n-helix.com/2023/06/tops.html
https://science.n-helix.com/2023/06/ptp.html
https://science.n-helix.com/2023/06/map.html
https://science.n-helix.com/2022/08/jit-dongle.html
https://science.n-helix.com/2022/06/jit-compiler.html
https://science.n-helix.com/2022/03/ice-ssrtp.html
https://science.n-helix.com/2022/01/ntp.html

Faster Maths & ML
https://science.n-helix.com/2018/01/integer-floats-with-remainder-theory.html
https://science.n-helix.com/2021/02/multi-operation-maths.html
https://science.n-helix.com/2021/11/parallel-execution.html
https://science.n-helix.com/2022/12/math-error-solve.html
https://science.n-helix.com/2021/03/brain-bit-precision-int32-fp32-int16.html
https://science.n-helix.com/2022/10/ml.html

Focus on Quality
https://science.n-helix.com/2022/09/ovccans.html
https://science.n-helix.com/2022/11/frame-expand-gen-3.html
https://science.n-helix.com/2022/03/fsr-focal-length.html

Code Speed

https://science.n-helix.com/2022/08/simd.html

https://science.n-helix.com/2022/09/ovccans.html


Chaos

https://science.n-helix.com/2022/02/interrupt-entropy.html

https://science.n-helix.com/2022/02/rdseed.html

https://science.n-helix.com/2020/06/cryptoseed.html

Example of a Secure Tunnel System:

https://is.gd/SecurityHSM https://is.gd/WebPKI

TLS Optimised
https://is.gd/SSL_Optimise

Ethernet Security
https://is.gd/EthernetTunnelOpt

*****

Suitable for codec, Texture, Video Element, Firmware & ROM, Executable, Storage & RAM, DLL & Library runtimes, CSS & JS & HDMI & DisplayPort VESA Specifications :


https://science.n-helix.com/2023/02/pm-qos.html
https://science.n-helix.com/2022/09/ovccans.html

Install and maintain as provided HPC Pack, Exactly as presented with nodes & functions; Be as generous as you can towards our research goals.

https://science.n-helix.com/2018/09/hpc-pack-install-guide.html

RS

*****

PM-QoS - Processor Model QoS Tree for TCP, UDP & QUICC


The Method of PM-QoS Roleplayed in a way that Firmware & CPU Prefetch ML Coders can understand.

Environment:
https://science.n-helix.com/2021/11/monticarlo-workload-selector.html
https://science.n-helix.com/2023/02/pm-qos.html
https://science.n-helix.com/2022/03/security-aspect-leaf-hash-identifiers.html

Multiple Busses &or Processor Features in an Open Compute environment with competitive task scheduling

[Task Scheduler] Monticarlo-Workload-Selector

We prioritise data traffic by importance & Need to ensure that all CPU Functions are used...

In the case of a Chiplet GPU We need to assign function groups to CU & QoS is used to asses available Multiple BUSS Capacities over competing merits,
[Merits : Buss Data Capacity, Buss Cycles, Available Features, Function Endpoint]

PM-QoS is a way of Prioritising Buss traffic to processor functions & RAM & Storage Busses that:

States a data array such as:

Buss Width

divisibility ((Example) Where you transform a 128Bit buss into 32Bit x 4 Data motions and synchronize the transfers,

Data Transfer Cycles Available

Used Data Rate / Total Data Throughput Rate = N

(c)Rupert S https://science.n-helix.com


**************************** Reference Ambition

Title: Specifying New Congestion Control Algorithms

Date: Fri, 17 Feb 2023 16:39:25 +0100

https://rscheff.github.io/rfc5033bis

https://github.com/rscheff/rfc5033bis/issues




Title: Specifying New Congestion Control Algorithms

Document date: 2023-02-17

https://www.ietf.org/archive/id/draft-scheffenegger-congress-rfc5033bis-00.txt

Status:

https://datatracker.ietf.org/doc/draft-scheffenegger-congress-rfc5033bis/

Abstract:

The IETF's standard congestion control schemes have been widely shown

to be inadequate for various environments (e.g., high-speed

networks). Recent research has yielded many alternate congestion

control schemes that significantly differ from the IETF's congestion

control principles. Using these new congestion control schemes in

the global Internet has possible ramifications to both the traffic

using the new congestion control and to traffic using the currently

standardized congestion control. Therefore, the IETF must proceed

with caution when dealing with alternate congestion control

proposals. The goal of this document is to provide guidance for

considering alternate congestion control algorithms within the IETF.

The IETF Secretariat