「研究進捗」カテゴリーアーカイブ

毎週の研究進捗の報告

週報(GAO)

2023年11月15日 GaoYuliang コメントする

The pruning frame has a BUG in the library, hard to make progress.

Ⅰ.learn the different varities of the IoU:

1.GIoU：https://openaccess.thecvf.com/content_CVPR_2019/html/Rezatofighi_Generalized_Intersection_Over_Union_A_Metric_and_a_Loss_for_CVPR_2019_paper.html

2.DIoU:

https://ojs.aaai.org/index.php/AAAI/article/view/6999

3.CIoU：

https://ieeexplore.ieee.org/abstract/document/9520717

4.MPDIoU：

研究進捗

週報(SUN YUYA)

2023年11月15日孙愉亚コメントする

This week I start conducting experiments about long term tracking. Except for the experiments, I still read some papers about long term object tracking.

The experiments are as follows: On the basis of a simple siamese tracker(SiamFC++), I modified it to a long term tracker，which needs a mechanism to judge whether the target object is absent. The most concise way is to exploit the confidence score. So I want to find a relationship between the score and IOU. The specific results are shown in the figure.

From the picture, we can see that the fluctuation trend of score is basically consistent with that of iou. But only relying on the score is not enough. So we may conduct experiments on the sequence of scores.

The new papers are as follows:

I change my mind that I should read more new papers instead of conducting experiments in haste. The specific analysis of papers are as follows:

1. Adaptive and Background-Aware Vision Transformer for Real-Time UAV Tracking

The paper’s purpose:

To slove the problem that traditional CNN is too slow.

Contributions:

(1) The paper proposes a framework, where feature learning and template-search coupling are integrated into an efficient one-stream ViT to avoid an extra heavy relation modeling module.

(2)The proposed Aba-ViT exploits an adaptive and background-aware token computation method to reduce inference time.

(3)This approach adaptively discards tokens based on learned halting probabilities, which a priori are higher for background tokens than target ones.

(4) Very Fast ! 180 fps!

Personal Evaluation:

Too fast! And the paper provide the code. We can let it act as main tracker.

2. Boosting UAV Tracking With Voxel-Based Trajectory-Aware Pre-Training

The paper’s purpose:

（1）To slove the problem that the siamese tracker was trapped when facing multiple views of object in consecutive frames.

（2）The general image-level pretrained backbone can overfit to holistic representations, causing the misalignment to learn object-level properties in UAV tracking.

Contributions:

(1) Fully exploit the stereoscopic representation for UAV tracking. Specifically, a novel pre-training paradigm method is proposed.

(2) Through trajectory-aware reconstruction training (TRT), the capability of the backbone to extract stereoscopic structure feature is strengthened without any parameter increment.

Personal Evaluation:

No code. The paper is related to 3D tracking.

3. Compact Transformer Tracker with Correlative Masked Modeling

The paper’s purpose:

(1) Proving that the traditional selfattention structure is sufficient for information aggregation, and structural adaption is unnecessary.

Contributions:

(1) The paper attaches a lightweight correlative masked decoder which reconstructs the original template and search image from the corresponding masked tokens.

(2) The structure is very simple.

Personal Evaluation:

Release code. The evaluation on benchmark is very high. Nice paper. But the analysis of network is beyond my ability.

4. Continuity-Aware Latent Interframe Information Mining for Reliable UAV Tracking

The paper’s purpose:

(1) Mainly focuses on explicit information to improve tracking performance, ignoring potential interframe connections.

Contributions:

(1) A network can generate highly-effective latent frame between two adjacent frames.

(2) Fully explore continuity-aware spatial-temporal information.

Personal Evaluation:

Release code. The innovation points are very innovative.

研究進捗

今週の進捗（根来）

2023年11月14日根来剛志コメントする

デンソーES提出

パナソニックES提出、Webテスト受験

ホンダOB個人面談、インターン面接

研究進捗

週報（白川）＃特許＃決定木結果＃大ゼミ

2023年11月14日白川智大コメントする

先生方からのご指摘

実際の血中アルコール濃度を検出しているわけではないので、検出→推定に変える。

アルコールインターロックシステムに使える推定方法なので、アルコールインターロックシステムの名前を絡ませる。もしくは、アルコールチェッカーよりも優位な点を示す。

生体パラメータは飲酒以外でも変化するので、飲酒状態を推定しているとは言い切れないのでは？→精度の高さで言い切る必要あり。

個人のためだけのモデルであれば、システムの補完に使えないのでは？→汎用化した時の精度を示す必要ある。もしくは運転者は誰かわかるので運転中にデータ採取、学習を行える方法を提案する必要あり

機械学習を用いた生体信号分析による飲酒状態推定法の有効性検証

とかにしようと思う

スマートウォッチと鍵を一体化したシステム→教師なし学習で二値分類→人の手を一度加えラベル付け→二度目から飲酒推定可能

類似研究：特許検索サイト

公開特許：脈拍、体温が閾値を超えているか（特開2023-061171）→運転開始時からの変化量？

生体信号を非侵襲で測定する特許が出願されている→これにより生体信号の測定方法で突っ込まれても反論できる。

研究進捗

週報(野崎)

2023年11月14日野崎愛斗コメントする

動画で中心窩レンダリングと物体検出を別々で行いました。大ゼミで指摘されたパワポの部分を直していました。

研究進捗

週報(豊永)

2023年11月14日豊永コメントする

研究テーマを変えようか迷っています。

研究進捗

飯田(週報)

2023年11月14日飯田コメントする

tensorflowではモデルの読み出しができなかったため、kerasを使用したところ読み出しに成功した。先週は、1匹分のモデルを作成し、識別を実行したが、犬の画像を猫と判別してしまったため、プログラムを見直す必要がある。

研究進捗

週報(西元)

2023年11月14日西元風音コメントする

大ゼミの発表のためのスライド作成をしました。

遠い人でも、認識できるように超解像について調べています。最初は一枚の画像を超解像してみようと思います。 RealESRGANというモデルが超解像するときに画像のぼやけが少なそうなので試してみます。一枚の画像からでは入力情報が足りなそうな場合は、難しそうですが一定時間の動画から複数枚の画像に分けて入力して超解像してみます。

研究進捗

今週の進捗(藤崎)

2023年11月14日藤崎雄仁コメントする

京セラ、NTT西日本、村田製作所に応募

~~申し訳ありませんが、自動車免許取得のため本日のゼミは欠席します。~~

意外と早く終わったので出れそうです。

研究進捗

WEEKLY REPORT (QI)

2023年11月9日戚子扬コメントする

First run the CenterPoint (3-D MOT) algorithm successfully

It took me about 6 days to establish the complete conda envoriment and run the Python algorithm of CenterPoint in my system (Ubuntu 20.04.6).

There is still a problem that the DCN part can be compiled, but it won’t run normally with my torch (1.12.0), an older version should be applied, but it will cause some other problems.

Problems records

not implemented for CPU ONLY build.

The CenterPoint contains two third-part libs, git clone cannot download both of them. They should be cloned desperately. And then, re-adding the CUDA path, CenterPoint path and third-part lib path to .bashrc. Definitely, one of libs can be installed by pip without compiling…… After that, this problem was solved.

2. RuntimeError:

———————————–File“/home/omaqzy/PycharmProjects/CenterPoint/det3d/ops/dcn/deform_conv.py”, line 93, in backward cur_im2col_step)

RuntimeError: view size is not compatible with input tensor’s size and stride (at least one dimension spans across two contiguous subspaces). Use .reshape(…) instead.

————————————–

This is the problem about DCN. Replaced the AT_CHECK with TORCH_CHECK in file deform_conv_cuda.cpp and deform_pool_cuda.cpp, so that the DCN can be compiled. When I run it to train the model, I will face the above problem.

The train result:

NEXT… …

Solve the above problem.

Learn the 3D-Detection and try to modify the algorithm.

月	火	水	木	金	土	日
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28

九州工業大学　芹川・張・山脇・楊研究室

「研究進捗」カテゴリーアーカイブ

週報(GAO)

週報(SUN YUYA)

今週の進捗（根来）

週報（白川）＃特許＃決定木結果＃大ゼミ

先生方からのご指摘

類似研究：特許検索サイト

週報(野崎)

週報(豊永)

飯田(週報)

週報(西元)

今週の進捗(藤崎)

WEEKLY REPORT (QI)

First run the CenterPoint (3-D MOT) algorithm successfully

Problems records

NEXT… …

Stay Hungry, Stay Foolish!