Comparing Long Short-Term Memory and Graph Convolutional Network Models for Human Activity Recognition using WISDM Dataset

مؤلف

Ebrahim, Majeed Mohamed Hasan

وكيل مرتبط

Zeki, Ahmed M. , مشرف الرسالة العلمية

تاريخ النشر

2023

اللغة

الأنجليزية

مدى

[12], 98, [6] pages

الموضوع

Long-term memory -- Dissertations

Short-term memory -- Dissertations

Memory -- Data processing -- Dissertations

مكان المؤسسة

Sakhir, Bahrain

نوع الرسالة الجامعية

Thesis (Master)

الجهه المانحه

University of Bahrain, College of Science, Department of Postgraduate Programs

الملخص الإنجليزي

Abstract: The development of technology in computer hardware has made observing and analyzing daily performed activities an easy task. Therefore, human activity recognition systems have benefited from the embedded sensors in smart devices in several fields, such as healthcare, security, and fitness. Furthermore, these sensors can generate sequenced data with temporal and spatial relationships. Therefore, Artificial Intelligence techniques, such as machine learning and deep learning, have been utilized for data processing and activity classification. While traditional machine learning methods have performed well, they require manual feature extraction from the raw data. Meanwhile, deep learning methods can extract the required features respecting spatial and temporal dependencies. Among those methods, The Long Short-Term Memory (LSTM) method is proven efficient in dealing with time-series data. In addition, the Graph Convolutional Network (GCN) has the ability to extract spatiotemporal relations using the graph data structure. However, as per author knowledge, the previous studies have yet to compare the performance of both methods, so this research aims to compare both methods using the WISDM dataset. The dataset contains data from accelerometers and gyroscope sensors embedded in smartphones and smartwatches for 18 performed activities. For each smart device, the data of two sensors have been combined to form two sub-datasets which are the preprocessed and prepared datasets for feeding models using the overlapped sliding windows. Hence, both methods have been applied to the resulting sub-datasets. As a result, for the smartphone sub-dataset, the GCN model outperformed the LSTM model by around 4%, scoring 94.4% for accuracy and F1-score. In addition, the GCN model performed better than some previous LSTM models considering a subset of activities. On the other hand, the smartwatch sub-dataset gave a slight advantage to the GCN model over the LSTM model by nearly 1%, as it scored 90.4% for accuracy and F1-score. However, the GCN method can be considered a rival to the LSTM method in such data type, which should be applied to other datasets in future works for generalization.

الملخص العربي

الملخص:

أدى تطوير التكنولوجيا في أجهزة الحاسوب إلى جعل مراقبة أنشطة الإنسان اليومية وتحليلها م همة سهلة. لذل ك
أفادت أنظمة التعرف على النشاط البشري من المستشعرات المدمجة في الأجهزة الذكية في عدة مجالات، مثل
الرعاية الصحية والأمن واللياقة البدنية. علاوة على ذلك، يمكن لهذه المستشعرات إنشاء بيانات متسلسلة ذات
علاقات زمنية ومكانية. لذلك، تم استخدام تقنيات الذكاء الاصطناعي، مثل التعلم الآلي والتعلم العميق، لمعالجة
البيانات وتصنيف النشاط. في حين أن طرق التعلم الآلي التقليدية قد حققت أداءً جيدًا، إلا أنها تتطلب استخراج
الميزات يدويًا من البيانات الأولية.
في الوقت نفسه، يمكن لأساليب التعلم العميق استخراج الميزات المطلوبة التي تراعي التبعيات المكانية والزمانية.
من بين تلك الطرق، أثبتت طريقة الذاكرة طويلة المدى ) LSTM ( فعاليتها في التعامل مع بيانات السلاسل الزمنية.
بالإضافة إلى ذلك ، فإن الشبكة التلافيفية للرسم البياني ) GCN ( لديها القدرة على استخراج العلاقات الزمانية
المكانية باستخدام بنية بيانات الرسم البياني. ومع ذلك، على حد معرفة الباحث، لم تقارن الدراسات السابقة أداء كلتا
الطريقتين، لذلك يهدف هذا البحث إلى مقارنة كلتا الطريقتين باستخدام مجموعة بيانات WISDM . تحتوي
مجموعة البيانات على بيانات من مقاييس التسارع وأجهزة استشعار الجيروسكوب المضمنة في الهواتف الذكية
والساعات الذكية ل 18 نشاطًا منفذ ا .
لكل جهاز ذكي، تم دمج بيانات المستشعرين لتشكيل مجموعتي بيانات فرعيين تم تجهيزهما مسبقًا وإعدادهما لتغذية
النماذج باستخدام النوافذ المنزلقة المتداخلة. ومن ثم تم تطبيق كلتا الطريقتين على مجموعات البيانات الفرعية
الناتجة. نتيجة لذلك، بالنسبة لمجموعة البيانات الفرعية للهواتف الذكية، تفوق نموذج GCN على نموذج LSTM
بحوالي 4 ٪، مسجلاً 94.4 ٪ من حيث الدقة ودرجة F1 . بالإضافة إلى ذلك، كان أداء نموذج GCN أفضل من
بعض نماذج LSTM السابقة مع الأخذ في الاعتبار مجموعة فرعية من الأنشطة. من ناحية أخرى، أعطت
مجموعة البيانات الفرعية للساعة الذكية ميزة طفيفة لنموذج GCN على نموذج LSTM بنسبة 1٪ تقريبًا، حيث
سجلت 90.4 ٪ من حيث الدقة ودرجة F1 . ومع ذلك، يمكن اعتبار طريقة GCN منافسة لطريقة LSTM في هذ ا
النوع من البيانات، والتي يجب تطبيقها على مجموعات البيانات الأخرى في الأعمال المستقبلية للتعميم.

ملاحظة

Title on Cover:
مقارنة نموذجي الذاكرة طويلة المدى والشبكة التلافيفية للرسم البياني للتعرف على النشاط البشري

المجموعة

College of Science

المعرف

https://digitalrepository.uob.edu.bh/id/007dc6d2-03c8-4856-afd3-303f939e81ec

مواد أخرى لنفس المؤلف

أطروحات

Comparing Long Short-Term Memory and Graph Convolutional Network Models for Human Activity Recognition using WISDM Dataset

Ebrahim, Majeed Mohamed Hasan

2023

مواد أخرى لنفس الموضوع

أطروحات

Comparing Long Short-Term Memory and Graph Convolutional Network Models for Human Activity Recognition using WISDM Dataset

Ebrahim, Majeed Mohamed Hasan

2023