I found the root cause. It was because I was not setting the top 2 bits of the physical address of zero slot to 1 before sending it to VideoCore. Apparently the top 2 bits enable cache coherency. Thanks.
Statistics: Posted by kfcgoose — Thu Oct 03, 2024 11:22 pm