Fast Pixel Access - GameCreators Forum

Author

Message

tboy

12

Years of Service

User Offline

Joined: 1st Jan 2013

Location:

Posted: 17th Apr 2017 09:11 Edited at: 17th Apr 2017 09:12

Link

Hi peeps,

I'm just getting back into AppGameKit after quite sometime and I'm working on a scratch card effect,
I can access pixel data using memblocks but I find the method I have come up with is quite
slow, if the mods read this or if anyone knows, is TGC planning on direct pixel access, especially
for HTML5?

I've managed to do this using JavaScript which is quite fast but I can't seem to get a decent one
working with AGK.

The way I've come up with is draw a red sprite to a temp image and all red pixels are replaced
with the pixel location of the image I want to reveal, it's possible with AppGameKit I see but not that fast,
anyone come up with a better method?

Thanks in advance

main.agc

// Project: testrenderimage 
// Created: 2017-04-16

#include "scratch.agc"

// show all errors
SetErrorMode(2)

// set window properties
SetWindowTitle( "testrenderimage" )
SetWindowSize( 640, 480, 0 )

// set display properties
SetVirtualResolution( 640, 480 ) // doesn't have to match the window
SetOrientationAllowed( 1, 1, 1, 1 ) // allow both portrait and landscape on mobile devices
SetSyncRate( 60, 0 ) // 30fps instead of 60 to save battery
SetScissor( 0,0,0,0 ) // use the maximum available screen space, no black borders
UseNewDefaultFonts( 0 ) // since version 2.0.22 we can use nicer default fonts

CreateRenderImage (1, 640, 480, 0, 0) 
imgTemp = CreateSprite(1)

imgPointer = CreateSprite(LoadImage("circle.png"))

imgPointerWidth = GetSpriteWidth(imgPointer)
imgPointerHeight = GetSpriteHeight(imgPointer)

imgPointerMidpointX = (imgPointerWidth/2)
imgPointerMidpointY = (imgPointerHeight/2)

Function DrawPointerSprite(spritePointer, Xpos, Ypos)
  SetSpriteTransparency(spritePointer, 2) 
  SetSpritePosition(spritePointer, Xpos, Ypos)
  DrawSprite(spritePointer)
EndFunction

do
  Print("Total Pixels: " + Str(pixelData.Length))
  Print("Total Red Pixels: " + Str(redCount2))
	
  If (GetPointerState() = 1)
    SetRenderToImage(1, 0)
    DrawPointerSprite(imgPointer, GetPointerX()-imgPointerMidpointX, GetPointerY()-imgPointerMidpointY)
	  
    imgPixelCounter = GetImageRGBA(1)
	   
     for i = 0 to pixelData.Length
        If (pixelData[i].red)
         redCount = redCount + 1
       EndIf
     next
	  
     redCount2 = redCount
     redCount = 0
   EndIf
   
    SetRenderToScreen()
    DrawSprite(imgTemp)
	
    DrawPointerSprite(imgPointer, GetPointerX()-imgPointerMidpointX, GetPointerY()-imgPointerMidpointY)

If (GetRawKeyPressed(27))
     End
  EndIf
	
    Sync()
loop

+ Code Snippet

// Project: testrenderimage 
// Created: 2017-04-16

#include "scratch.agc"

// show all errors
SetErrorMode(2)

// set window properties
SetWindowTitle( "testrenderimage" )
SetWindowSize( 640, 480, 0 )

// set display properties
SetVirtualResolution( 640, 480 ) // doesn't have to match the window
SetOrientationAllowed( 1, 1, 1, 1 ) // allow both portrait and landscape on mobile devices
SetSyncRate( 60, 0 ) // 30fps instead of 60 to save battery
SetScissor( 0,0,0,0 ) // use the maximum available screen space, no black borders
UseNewDefaultFonts( 0 ) // since version 2.0.22 we can use nicer default fonts

CreateRenderImage (1, 640, 480, 0, 0) 
imgTemp = CreateSprite(1)

imgPointer = CreateSprite(LoadImage("circle.png"))

imgPointerWidth = GetSpriteWidth(imgPointer)
imgPointerHeight = GetSpriteHeight(imgPointer)

imgPointerMidpointX = (imgPointerWidth/2)
imgPointerMidpointY = (imgPointerHeight/2)

Function DrawPointerSprite(spritePointer, Xpos, Ypos)
  SetSpriteTransparency(spritePointer, 2) 
  SetSpritePosition(spritePointer, Xpos, Ypos)
  DrawSprite(spritePointer)
EndFunction

do
  Print("Total Pixels: " + Str(pixelData.Length))
  Print("Total Red Pixels: " + Str(redCount2))
	
  If (GetPointerState() = 1)
    SetRenderToImage(1, 0)
    DrawPointerSprite(imgPointer, GetPointerX()-imgPointerMidpointX, GetPointerY()-imgPointerMidpointY)
	  
    imgPixelCounter = GetImageRGBA(1)
	   
     for i = 0 to pixelData.Length
        If (pixelData[i].red)
         redCount = redCount + 1
       EndIf
     next
	  
     redCount2 = redCount
     redCount = 0
   EndIf
   
    SetRenderToScreen()
    DrawSprite(imgTemp)
	
    DrawPointerSprite(imgPointer, GetPointerX()-imgPointerMidpointX, GetPointerY()-imgPointerMidpointY)

   If (GetRawKeyPressed(27))
     End
  EndIf
	
    Sync()
loop

scratch.agc

+ Code Snippet

Type imageData
  red As Integer
  green As Integer
  blue As Integer
  alpha As Integer
EndType

Global pixelArray As Integer[0]
Global pixelData As imageData[0]

Function GetImageRGBA(imageID As Integer)
  startByte = 12
   
  imgMem = CreateMemblockFromImage(imageID)
  
  imgDataSize = GetMemblockSize(imgMem)
  
  pixelArray.length = imgDataSize
  pixelData.length = ((imgDataSize)/4)-4
  
  For n = startByte To imgDataSize-1
    pixelArray[n-startByte] = GetMemblockByte(imgMem, n)
  Next
  
  For i = 0 To pixelData.Length
    pixelData[i].red = pixelArray[(i*4)]
    pixelData[i].green = pixelArray[(i*4)+1]
    pixelData[i].blue = pixelArray[(i*4)+2]
    pixelData[i].alpha = pixelArray[(i*4)+3]
  Next
  
  DeleteMemblock(imgMem)
EndFunction imageID

Back to top

Profile PM

BatVink

Moderator

22

Years of Service

User Offline

Joined: 4th Apr 2003

Location: Gods own County, UK

Posted: 17th Apr 2017 09:54 Edited at: 17th Apr 2017 09:56

Link

This has been discussed recently (can't remember which thread). I think that what you are doing is probably as efficient as you can get at the moment, there is no direct backbuffer access. somebody looked into the technicalities and decided it would be quite hard for TGC to implement.

An alternative might be image masks. You could draw your transparency to a second image (e.g as white on black) and apply this as a transparency mask to your actual image. You could maybe have some sprite "stamps" that you draw to the mask to cover a larger area. Check out this thread regarding masks.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Quidquid latine dictum sit, altum sonatur
TutCity is being rebuilt

Back to top

Profile PM Email

Phaelax

DBPro Master

22

Years of Service

User Offline

Joined: 16th Apr 2003

Location: Metropia

Posted: 17th Apr 2017 15:03

Link

Quote: " somebody looked into the technicalities and decided it would be quite hard for TGC to implement."

Is it because of the multiple devices supported? They did it with DBP.

"I like offending people, because I think people who get offended should be offended." - Linus Torvalds

Back to top

Profile PM Email Website

BatVink

Moderator

22

Years of Service

User Offline

Joined: 4th Apr 2003

Location: Gods own County, UK

Posted: 17th Apr 2017 15:13

Link

It worked really well with DBP, I wrote a tutorial and we had a related competition - https://www.thegamecreators.com/pages/newsletters/newsletter_issue_33.html#12

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Quidquid latine dictum sit, altum sonatur
TutCity is being rebuilt

Back to top

Profile PM Email

Kevin Picone

22

Years of Service

User Offline

Joined: 27th Aug 2002

Location: Australia

Posted: 17th Apr 2017 15:18 Edited at: 17th Apr 2017 15:36

Link

pulling the image data from device can be very expensive (the GPU and CPU aren't on the same BUS), so it's not an ideal structure for doing something real time.

anyway the cost per pixel in the GetImageRGBA function seems pretty high with two loops with some redundant calcs inside the inner loops..
so you should be able to restructure and improve the through put of that part of the code, but the bottle neck may well be pulling it fom video memory to begin with or freeing the buffer each time.

So taking a look at this part,

+ Code Snippet

    
  For n = startByte To imgDataSize-1
    pixelArray[n-startByte] = GetMemblockByte(imgMem, n)
  Next
   
  For i = 0 To pixelData.Length
    pixelData[i].red = pixelArray[(i*4)]
    pixelData[i].green = pixelArray[(i*4)+1]
    pixelData[i].blue = pixelArray[(i*4)+2]
    pixelData[i].alpha = pixelArray[(i*4)+3]
  Next

The second loop contains pixel offset calc, so it'd be easier on the runtime to only compute this once.

+ Code Snippet

    
  For n = startByte To imgDataSize-1
      pixelArray[n-startByte] = GetMemblockByte(imgMem, n)
  Next

  For i = 0 To pixelData.Length
     ; compute offset once,  removes 3 calcs per pixel   
     Offset= i*4
      pixelData[i].red = pixelArray[Offset]
     pixelData[i].green = pixelArray[Offset+1]
     pixelData[i].blue = pixelArray[Offset+2]
     pixelData[i].alpha = pixelArray[Offset+3]
  Next

Depending upon the runtime FOR/NEXT loops can either compute the END expression every loop or they'll pre compute the END loop value once and protect it within a local.. IF it computes the expression every loop, then some free speed can be had just be computing the end values up front.

+ Code Snippet

    

  imgDataSize_MinusOne =imgDataSize-1
  For n = startByte To imgDataSize_MinusOne
          pixelArray[n-startByte] = GetMemblockByte(imgMem, n)
  Next

  ; make sure we only compute size once
   PixelDataSize =pixelData.Length

  For i = 0 To pixelDataSize
     ; compute offset once,  removes 3 calcs per pixel   
     Offset= i*4
      pixelData[i].red = pixelArray[Offset]
     pixelData[i].green = pixelArray[Offset+1]
     pixelData[i].blue = pixelArray[Offset+2]
     pixelData[i].alpha = pixelArray[Offset+3]
  Next

The problem with this is there's twp loops going over the data byte by byte.. so a 256*256 image gives 256*256*4*2 loops.. which is lot of empty overhead for a runtime to soak up..

You can rid of half of the looping just by merging them..

+ Code Snippet

    

  imgDataSize_MinusOne =imgDataSize-1

  ; make sure we only compute size once
   PixelDataSize =pixelData.Length

  ; Prolly shoukd check if the target buffer is big enough for data. but we won't here

  For n = startByte To imgDataSize_MinusOne step 4

     ; compute dest offset
      i = (N- StartByte) / 4

      ; unroll the read to grabdthe 4 pixels 
     pixelData[i].red      = GetMemblockByte(imgMem, n)
     pixelData[i].green = GetMemblockByte(imgMem, n+1)
     pixelData[i].blue    = GetMemblockByte(imgMem, n+2)
     pixelData[i].alpha  = GetMemblockByte(imgMem, n+3)
 
 Next

if you assume 1 to 1 ratio of runtime opcodes to user code (which is very unlikely) but easy to visualize.. The cost per pixel is about say 15 operations compared to 28/29 in the original loop.
Even so it's still not going to sing if you is throw a big image at it... Although if know the section that has changed, then you can selectively grad that section from the memblock->array. So only when the entire image changed would you need to brute force the buffer.

You could just read the pixels as Integers and split up the RGB's, but that's something for you to do.. although the split cost might be heavier than a memblock peek... but that's where the fun is in this stuff

NOTE: All this is untested.

PlayBASIC To HTML5/WEB - Convert PlayBASIC To Machine Code

Back to top

Profile PM Website

tboy

12

Years of Service

User Offline

Joined: 1st Jan 2013

Location:

Posted: 17th Apr 2017 15:28

Link

Some good optimization suggestions and ideas, I'll give them a try and see what difference it could make.

Many thanks!

Back to top

Profile PM

CodeName

8

Years of Service

User Offline

Joined: 30th Dec 2016

Location:

Posted: 17th Apr 2017 23:49 Edited at: 18th Apr 2017 04:18

Link

http://www.roxlu.com/2014/048/fast-pixel-transfers-with-pixel-buffer-objects

Maybe Mr. Johnston wouldn't mind this implement?

AGK2 dose not use lock_pixels() but this new method runs the GPU then you take information! ( asynchronously ) : Edit

Losing out on just a few FPS.

Would be cool to be able to crunch data or read ABGR in AGK2.

A Get_Image would then become much much faster!

Take care.

Back to top

Profile PM

Sorry your browser is not supported!

Newcomers AppGameKit Corner / Fast Pixel Access