jump to navigation

Influence of downsampling filter characteristics on compression performance in wavelet-based scalable video coding September 23, 2008

Posted by whaldsz in : research , add a comment

The application of different downsampling filters in video coding directly models visual information at lower resolutions and influences the compression performance of a chosen coding system. In wavelet-based scalable video coding the spatial scalability is achieved by the application of wavelets as downsampling filters. However, characteristics of different wavelets influence the performance at targeting spatio-temporal decoding points. An analysis of different downsampling filters in popular wavelet-based scalable video coding schemes is presented. Evaluation is performed for both intra- and inter-coding schemes using wavelets and standard downsampling strategies. On the basis of the obtained results a new concept of inter-resolution prediction is proposed, which maximises the average performance using a combination of standard downsampling filters and wavelet-based coding.

More: continued here

Class-Based Feature Matching Across Unrestricted Transformations September 23, 2008

Posted by whaldsz in : research , add a comment

We develop a novel method for class-based feature matching across large changes in viewing conditions. The method is based on the property that when objects share a similar part, the similarity is preserved across viewing conditions. Given a feature and a training set of object images, we first identify the subset of objects that share this feature. The transformation of the feature’s appearance across viewing conditions is determined mainly by properties of the feature, rather than of the object in which it is embedded. Therefore, the transformed feature will be shared by approximately the same set of objects. Based on this consistency requirement, corresponding features can be reliably identified from a set of candidate matches. Unlike previous approaches, the proposed scheme compares feature appearances only in similar viewing conditions, rather than across different viewing conditions. As a result, the scheme is not restricted to locally planar objects or affine transformations. The approach also does not require examples of correct matches. We show that by using the proposed method, a dense set of accurate correspondences can be obtained. Experimental comparisons demonstrate that matching accuracy is significantly improved over previous schemes. Finally, we show that the scheme can be successfully used for invariant object recognition.

More: continued here

Design of Multimodal Dissimilarity Spaces for Retrieval of Video Documents September 23, 2008

Posted by whaldsz in : research , add a comment

This paper proposes a novel representation space for multimodal information, enabling fast and efficient retrieval of video data. We suggest describing the documents not directly by selected multimodal features (audio, visual or text), but rather by considering cross-document similarities relatively to their multimodal characteristics. This idea leads us to propose a particular form of emph{dissimilarity space} that is adapted to the asymmetric classification problem, and in turn to the emph{query-by-example} and emph{relevance feedback} paradigm, widely used in information retrieval. Based on the proposed dissimilarity space, we then define various strategies to fuse modalities through a kernel-based learning approach. The problem of automatic kernel setting to adapt the learning process to the queries is also discussed. The properties of our strategies are studied and validated on artificial data. In a second phase, a large annotated video corpus, (emph{ie} TRECVID-05), indexed by visual, audio and text features is considered to evaluate the overall performance of the dissimilarity space and fusion strategies. The obtained results confirm the validity of the proposed approach for the representation and retrieval of multimodal information in a real-time framework.

More: continued here

Caliper Software for Milled-Rice – Counts the number of grains, headrice and broken kernels September 14, 2008

Posted by whaldsz in : projects, software, technology , 1 comment so far

Milled rice software caliper This is an application for counting milled rice grains, determining the count of headrice, broken, and brewers. Using these grade factors, the caliper software (as I call it) is able to estimate the total weight in terms of percentage. Weight estimation of rice grain is performed using linear regression and support vector machines (SVM). It uses AForge library from AForge.NET for various image processing task.

I did this software for Bureau of Post-harvest Research Extension (BPRE) and indirectly, to National Food Authority (NFA) for the purpose of quickly determining the grain size of milled rice.

The demo and initial version of the caliper software is available for download here:

To use the software, simply install the application, then open the sample image included in the installation (“bigas.bmp”), then that’s it!  You can try it for other similar problems, like corn, barley, etc.  Please give me feedback if problems arises.  For any questions, you may email me at {vlad_crasher at yahoo point com}.

UPDATE: Yesterday (September 15, 2008), they told me that the test was successful!  The classification result did matched with the manual methods performed by human inspector.

New update for Google Chrome! September 6, 2008

Posted by whaldsz in : technology , add a comment

Update your Google Chrome now… old build is 0.2.149.27, the new build 0.2.149.29 is available!  Existing installation can be updated by clicking the Customize and control Google Chrome ~~> About Google Chrome.

chrome_newbuild

Also, if you are unaware of the new Google Chrome’s updated terms of service, read Mike Yang’s post on Google blog.