This paper presents a new architecture model, named Weld, for VLIW processors. Weld integrates multithreading support into a VLIW processor to hide run-time latency effects that
cannot be determined by the compiler. It does this through a novel hardware technique called operation welding that merges operations from different threads to utilize the hardware resources more efficiently. Hardware contexts such
as program counters and fetch units are duplicated to support multithreading. The experimental results show that the Weld
architecture attains a maximum of 27% speedup as compared to a single-threaded VLIW architecture. A MultiOp is a group of
instructions that can be potentially executed in parallel.