Researchers Claim 1,000 Core Chip Created
eldavojohn writes "Remember a few months ago when the feasibility was discussed of a thousand core processor? By using FPGAs, Glasgow University researchers have claimed a proof of concept 1,000 core chip that they demonstrated running an MPEG algorithm at a speed of 5Gbps. From one of the researchers: 'This is very early proof-of-concept work where we're trying to demonstrate a convenient way to program FPGAs so that their potential to provide very fast processing power could be used much more widely in future computing and electronics. While many existing technologies currently make use of FPGAs, including plasma and LCD televisions and computer network routers, their use in standard desktop computers is limited. However, we are already seeing some microchips which combine traditional CPUs with FPGA chips being announced by developers, including Intel and ARM. I believe these kinds of processors will only become more common and help to speed up computers even further over the next few years.'"
How long will it be before we will see the first motherboards with FPGA emerge?
Then you can download the CPU type of your choice:
-- naah, I don't like this new Intel core, I will try the latest AMD instead...
or 1,000 logic blocks? Are they equivalent? Aren't FPGAs common and generally contain multiple logic blocks?
I still cannot find the droids I am looking for...
It may be too late, but perhaps someone could talk with Viva Computing, LLC who now owns the assets of Star Bridge Systems. It was not specified in the news release if they also own the intellectual property.
"Giving money and power to governments is like giving whiskey and car keys to teenage boys." - P.J. O'Rourke
This story was already submitted two times before eldavojon managed to get it to the front page in a little over an hour...
http://tech.slashdot.org/submission/1432844/University-of-Glasgow-pioneers-1000-core-processor
http://tech.slashdot.org/submission/1432512/1000-core-processors-
"Lame" - Galaxar
...to a paper that assumes that the reader already knows what a cpu is? This article is content-free.
Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
I think this is a great development. I've been using FPGAs in medical imaging for about 15 years. The groups that use the GPUs are getting great performance--definitely--but seeing as how MRI and CT machines are placed and need to run for 10, 15 20 years, I don't see how the GPUs will survive that time. One large OEM was pushing the GPUs for their architecture and I can't believe it will be successful if success is measured on the longevity scale. I'm sure the service sales guy will clean up.
Why do GPUs fail? I'm not sure of the exact modes of failure but the amount of heat has got to have something to do with it. FPGAs will run much cooler and in the FLOPS/Watt game, will win.
The story's been up for 20 minutes and no one's tried to imagine a Beowulf cluster of them yet? This is a great sadness.
The researchers then used the chip to process an algorithm which is central to the MPEG movie format – used in YouTube videos – at a speed of five gigabytes per second: around 20 times faster than current top-end desktop computers.
20x speed is getting closer to what I need before I can even ATTEMPT to build my very own Holodeck.
http://en.wikipedia.org/wiki/Holodeck
He who knows best knows how little he knows. - Thomas Jefferson
Yawn. Seriously.
(says the guy who does FPGA design for a living.)
We first need to break a lock of x86 instruction set and the operating system that requires it. CPUs already try to execute multiple x86 instructions in parallel, but this is severely limited by sequential instruction set design. There needs to be a way to express computation A and B using different sets of virtual registers and let hardware execute them sequentially or in parallel depending on its capabilities, or vectorize/parallelize multiple iterations of a loop. If software, including operating systems, is coded in higher level virtual machine bytecode interpreted by hypervisor, a lot of parallelism can be expressed for future use while still permitting efficient execution on current hardware. LLVM is a good start, although it needs a lot more concurrency/vectorization information to take advantage of coprocessors, GPUs and massively parallel architectures.
"However, we are already seeing some microchips which combine traditional CPUs with FPGA chips being announced by developers, including Intel and ARM."
welcome to 2004.
Xilinx Virtex II, includes internal PPC 405GP
A programmable hardware platform would provide amazing computing power because of hardware specialization: rather than emulating a proper CPU, you would download core architecture into the FPGA to accelerate tasks such as REGEX processing or H.264 decoding. You could compile the entire logic of a program into a gate array with various logical operators and flip-flop circuits for unlimited (albeit slow) registers (L2 registers) as well as including standard registers and SRAM cache (L1).
Although the FPGA runs slower than a regular CPU, direct programming rather than instructional programming (that is logic blocks that perform programmatic functions, rather than logic blocks that interpret discrete instructions to follow programmatic functions) would shorten the overall hardware logic path. In short, the chip would follow fewer clock cycles and instead just "do things." The CPU would be slow, but optimized for your workload. The main performance bottleneck would be the context switch: replacing the logic gate configuration with a new program every time you switch. Other than that, dynamic program expansion could be utilized: inlining operations like multiplication, addition, etc, or breaking them out if space constraints make it hard to load the whole program onto the FPGA that way.
The obvious, major issue we see is, of course, a security issue. You can now reprogram the CPU. This makes it difficult to prevent a program from bypassing any and all hardware security measures. This is solved by implementing a completely new security design on the chip, by which the CPU itself (the FPGA) is under control of external security mechanisms (paging etc handled in the MMU, outside the FPGA space, would largely mitigate most of this); it's not impossible to deal with, it's just an issue that needs to be raised.
In short, this sucks for "download the new Intel CPU into your BIOS/bootloader." This sucks for whatever general purpose CPU you can think of. For an entirely new programmatic platform, however, this would provide some interesting performance possibilities, and some interesting challenges.
Support my political activism on Patreon.
Now we need a chip that can take any given problem and divide it into one thousand parts so we can feed it into these processors. -Gives me a headache!
Not all problems map well to current GPU offerings. I have a problem that would benefit from parallel processing but due to a branchy algorithm and very random access for read/write, I can't really take advantage of GPU's to the extent some algorithms can (note: I have coded and run it on GPU's so this is more than just theory, additionally I have coded it to run on a network of computers and unfortunately the calc time vs network transmission time ratio for each cycle is not favorable enough for that to be a very good solution either, best solution is many cores accessing same memory).
For this particular problem, a large number of minimally functional "cpus" or "cores" would be ideal, some basic math, logic and branching. An FPGA is one way to try to achieve something like this.
Yes.
Tiller's Rule: Never use a word in written form that you've only heard and never read. You will end up looking foolish.
Software developers have barely figured out how to write single threaded algorithms without crashing. Now we are seeing more multithreaded algorithms with race conditions, deadlocks and other data-sharing bugs.
Can you imagine what will happen if every desktop machine has one or two FPGAs available for programs to use as needed?
PHB says "Hey, I've heard that you can make the program faster if you program custom hardware on the motherboard's FPGA. Get the new intern to write some FGPA code for our algorithms, and then re-write the module to use it. We'll ship it next month!"
Multicore processors has made software development an much more difficult, and putting a FPGA there will make it another two orders of magnitude more difficult. And programmers aren't getting smarter nearl as fast as the hardware is getting more complicated to program.
I hate it when I make a joke and I get modded "+5 insightful". Mod the stupid comments "funny", not "insightful", pleas
Ten years ago some young 6-digit ID Slashdotter was getting modded down for suggesting a Beowulf cluster of cores. Who's laughing now, mods?!?!?
I swear to God...I swear to God! That is NOT how you treat your human!
Without digging for any additional information, it bugs me that this chip has 1000 cores and not 1024.
What are the practical differences between targeting an FPGA on a computing platform and targeting more ubiquitous massively-parallel programmable pipelines in modern GPUs? Also, what are the fundamental differences? Could my GPU already contain FPGAs?
The main difference is that you don't program FPGAs. You do synchronous digital logic design which is implemented in the FPGA fabric. Thinking that you can program them like you program a sequential-execution processor is a recipe for failure. And, yeah, C-to-gates tools are a joke.
The ultimate end to this trend is to build a system that is just core processing logic, with logic and memory all fused as closely as possible. I call it the BitGrid... it consists of 4bit look up tables hooked into an orthogonal grid. Because every single table can be used simultaneously, there is no Von Neuman bottleneck to worry about.
Petaflops... here we come.... !
Someone had to say it, be kind.
Mine goes to 1011.
Proverbs 21:19
I think this is fantastic that a 1000-core processor is in development.
I hate to be the devil's advocate but at what point will Amdahl's Law take hold fully and adding more cores to a processor will prove to be a fruitless endeavor?