any RIO/ FPGA users out there?

mattnrel · September 24, 2004

Are there any NI-RIO/FPGa users out there intested in discussing issues regarding this product?

=]

matt hill

lavezza · September 26, 2004

My cRIO system just shipped. I haven't done any FPGA programming yet, so I'm sure I'll have a lot to talk about in the coming weeks.

Richard_Jennings · September 28, 2004

Are there any NI-RIO/FPGa users out there intested in discussing issues regarding this product?
=]

matt hill

1979[/snapback]

Hi Matt,

I'll repost our earlier email Q&A:

On Monday, September 27, 2004, at 10:05 AM, Hill, Matt wrote:

I'm acquiring PCM encoded streams and decoding them and sending the results to an RT target in real time. (The decoded data must be sent every 2.5 ms.) I'm trying to implement this routine as many times as I can on a single board. Basically I'm running out of space. I'm trying to determine what coding techniques are the most efficient. And there doesn't seem to be any way to isolate a bit of code and benchmark how much of the FPGA it occupies, so it's a difficult process.

>>Are you using single-cycle loops? They save a lot of space on the FPGA. Also watch out for the extra logic added by DIO arbitration.

Also, I'm trying to achieve high speed in my processing. In this area I can isolate code and test it, but benchmarking every available coding option is a bit tedious. It would be nice to have some general guidelines.

>>I use: single-cycle loops, pipelining, a digital line as a flag to verify and benchmark execution on a scope, avoid deeply nested logic, pipeline execution whenever possible.

Here are some examples of the questions I have:

1. It seems that the fastest way to send data from the FPGA to the RT target is to use a single IRQ on the FPGA, and then to post data word by word in a synchronous indicator. A synchronous indicator only updates once it has been read by the RT target. This technique is undocumented by NI, but one of their FPGA people showed it to me. I have found this to yield the highest transfer rates. It would be nice to understand why this is faster than sending the data in an array.

>>Under LV 7.0 the max data rate I could achieve was 4.2 Mbits/sec. I stored data into on-chip memory in one loop and read it out in another. The read-out loop would place the data into an array and flag an interrupt. I haven't tried this with the synchronous indicator - just found out about it Thursday. Interrupt driven transfers worked, but even on our monster machine, the CPU was bogged down handling interrupts.Forget about placing a graph on the front panel to display the data. My solution was to complain loudly to NI and route the byte-aligned data in parallel out the external connector to a 6534 DIO card. It's a horrible kludge but it gets the data in the PC.

2. I have problems where my FIFOs become unlinked with the program and return random data, but there is no error. This I believe this unlinking is a bug that occurs when you rename an FPGA vi or copy it. Has anyone else experienced this?

>> I haven't.

3. When storing data temporarily, I use FIFOs. These can be created as flip-flops, look up table or block memory. I would like to know the trade off between speed and FPGA space when comparing flip-flops to block memory.

>> Don't know

4. What is the penalty incurred for creating a subvi within FPGA code, both is terms of time and space?

>>I was told none. Everything is treated as one monilithic program by the compiler.

5. How does saving data in a shift register compare with saving it in a fifo?

>>Shift registers are mapped directly into logic gates. Not sure what overhead is added by fifos.

6. If I have a Boolean constant linked to 5 data sinks, should I separate this into five constants, thinking it will require less signal routing in the fpga, or link all sinks to the same constant?

>>Don't know, but I think this would be something the compiler would optimize.

7. If I send a U8 source into a U32 sink, I get the grey dot indicating a data type mismatch and Labview changes the data to the correct type for me. Is it more efficient to let labview perform the conversion, or should I insert a conversion node myself.

>>Don't know, but I would guess it's the same

8. To what lengths should I go to avoid using case structures?

>>I use them wherever I need them. I avoid deeply nested case structures, and try to minimize the number of cases. In the next issue of LabVIEW Technical Resources is a short article on LabVIEW FPGA that you might find interesting.

The FPGA code has its own rules and there seems to be very little documentation available to explain the optimal way of approaching common tasks. Any little tips and tricks that anyone has discovered would be very appreciated.

>>What are yours? Richard

mattnrel · September 28, 2004

I'm migrating this thread to LAVA from the Info-Labview list so that others can follow it...